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Abstract. The top quark is the heaviest of the six quarks of the Standard Model. Precise knowledge of its mass is 
important for imposing constraints on a number of physics processes, including interactions of the as yet unobserved 
Higgs boson. The Higgs boson is the only missing particle of the Standard Model, central to the electrowcak symmetry 
breaking mechanism and generation of particle masses, in this Review, experimental measurements of the top quark 
mass accomplished at the Tevatron, a proton-antiproton collider located at the Fermi National Accelerator Laboratory, 
are described. Topologies of top quark events and methods used to separate signal events from background sources 
are discussed. Data analysis techniques used to extract information about the top mass value are reviewed. The 
combination of several most precise measurements performed with the two Tevatron particle detectors, CDF and D0, 
yields a value of Mt = 173.2 ± 0.9 GeV/c^. 
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1. Introduction 

The Standard Model of Particle Physics unifies the weak 
and electromagnetic forces into a single quantum field 
theory. The addition of Quantum Chromodynamics 
(QCD), which describes the strong interactions that 
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bind quarks into protons and neutrons, completes the 
Standard Model (SM). The elements of this unified 
theory arc six quarks, six leptons and five gauge bosons. 
The gauge bosons are the and Z (carriers of the weak 
force), the photon (carrier of the electromagnetic force) 
and the gluon (carrier of the strong force). An additional 
neutral scalar boson, the Higgs boson, is necessary to 
explain electroweak (EWK) symmetry breaking, i.e., the 
observation of non-zero masses of the and Z bosons. 
It also generates quark and lepton masses through the 
Yukawa interaction. A recent review of EWK symmetry 
breaking scenarios can be found in an earlier issue of this 
journal [1]. 

The top quark is the heaviest fundamental fermion. 

Prior to its direct observation, its mass was predicted 
through a fit to a number of EWK observables sensitive 
to virtual top quark effects. This prediction, however, 
had a very large uncertainty (for historical details, see a 
plot of top mass expectations and measurements versus 
time in [2]). The mass of the still unobserved Higgs 
boson, Mfj. is related within the electroweak theory to 
the W boson mass, Mw, and the top quark mass, Mt, 
through quantum loop corrections. Some of the lowest 
order diagrams that link M-w, M^, and Mh are shown 
in Figure 1. 




Figure 1. Lowest order diagrams that 
correlate M-yy, Mt, and Mjj. 



Precision measurements of the masses of the W 
boson and the top quark are essential to predict the mass 
of the Higgs boson. An overall fit of EWK observables 
including the W and the top masses can put constraints 
on the Higgs mass [3] . Figure 2 illustrates the relationship 
between the three masses, given current measurements. 
The Standard Model fit of 18 EWK observables (without 
the mass measurements) constrains the Higgs mass to lie 
inside the dashed contour, while the precision with which 
the W and top masses are currently known constrains the 
Higgs mass to the smaller solid contour. From the latter 
we see that a change of 1 GeV/c^ in the top mass shifts 
the predicted central value of the Higgs mass by ~ 10 
GeV/c2. 

The discovery of the bottom quark in 1977 [4] set in 
motion the search for its partner in the third fermion 
doublet. Experimental lower limits on the top mass 
slowly increased from a few GeV/c^ until the top quark 
was observed and its mass was directly measured at the 
Tcivatron 18 yciars later [5, 6]. A first hint for the top 
quark was reported by the CDF collaboration in [7], 
together with a mass value of 174 ± 10 ± 13 GeV/c^. 
Today, the measured value of the top quark mass is 
not very far from this very early estimate. Increased 
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Figure 2. The relationship between M^, 
Mt, and Mjj. For each value of Mjj, the 
Standard Model constraints possible values 
of and Mt so that they have to lie 

along the corresponding diagonal band. The 
dashed contour is the indirect constraint on 
M-\Y and Mt from measurements of 18 EWK 
observables; the solid contour is the expectation 
from the M\y and Mt direct measurements. All 
contours are for the 68% CL fit result [3]. 

statistics, better understanding of detector performance, 
and better measurement techniques have reduced the 
uncertainty considerably. 

The top quark is much heavier than its partner, the 
bottom quark, whose mass is about 5 GeV/c^ (see [8] for 
a review on quark masses). The Yukawa coupHng of the 
top quark. At = 2'^/'* G^J."^ Mt, is of order unity. This raises 
the question if the top quark is distinct from the other 
quarks, i.e., does it have a special role in the electroweak 
symmetry breaking? A dynamical breaking of EWK 
theory by a top quark condensate was proposed even 
before the top quark was discovered [9] , later extended to 
a topcolor model [10]. So far no experimental evidence 
for the validity of such a model has been found. 

1.1. Top mass definition 

When referring to quark masses, it is important to define 
which theoretical framework is used for the given value 
of the mass. For example, in the overall fit of electroweak 
measurements the top quark mass needs to be expressed 
in the M S renormalization scheme. It is not completely 
clear how to relate the mass measured in tt production 
experiments with the mass used in the EWK fit. It is 
normally assumed that what is being measured is the 
pole mass, Afpoic- The relation between Mpoio and 
the mass in the MS scheme, Mjfg, can be computed 
within perturbative QCD. Using approximate next-to- 
next-to-leading order (NNLO) calculations, the difference 



is about 10 GeV/c^, the MS mass being smaller [11, 12]. 

The top quark mass can be determined from a 
measurement of the total production cross section. The 
cross section dependence on the mass can be calculated 
in any renormalization scheme, and the results for the 
MS scheme are given in [11]. The D0 collaboration 
has extracted a top mass in the MS scheme from such 
a measurement [13], but the precision of this method 
is not comparable with that achieved by direct top 
mass measurements from the top decay products, with 
a subsequent change from Mpoie to Mj^. 

A number of theoretical questions arise in relating 
the mass measured directly to the pole mass [12, 14]. 
First, the pole mass is sensitive to an infrared renormalon, 
which implies that the value of the pole mass is modified 
by an amount of the order of Aqcd as the order in 
perturbation theory is changed [15]. Second, there are 
some doubts as to the precise definition of the measured 
mass. The direct measurements reported here are all 
calibrated with Monte Carlo generators, therefore what 
is measured is the mass parameter used in the generators, 
Mmc- The relationship between Mpoio and Mmc can be 
represented in the general form [12, 16]: 

Mpoic = Mmc + Qo[as(Qo)ci + •••], 

where the coefficient ci is not known (it depends on 
parton shower implementation details in each particular 
generator), but likely to be of the order unity. The main 
question is what the appropriate value of the scale Qq 
should be. It has been argued [12] that the cutoff on 
radiation in the parton shower evolution of order 1 GeV 
employed by generators like PYTHIA [17] implies that 
Qo is of order 1 GeV as well. The difference between 
the measured mass and the pole mass would then be 
0(1 GeV/c^). Furthermore, it is not clear if other 
approximations used in the parton shower development 
alter the resulting mass, as discussed in Appendix C 
of [16]. Theoretical studies of this problem are in progress 
along the lines presented in [18]. It is expected that the 
relation between Mmc and Mpoic will be understood in 
the not too distant future. 

1.2. Notation 

In the subsequent text, we refer to a number of kinematic 
and physics quantities. The most common ones are 
denoted by the following symbols: 

•q — Pseudorapidity which characterizes direction of 
a vector {e.g., particle momentum) with respect to the 
colliding beam axis (the z axis). It is related to the vector 
polar angle, 9,hy r] = — In [tan(0/2)]. 

(j) — Azimuthal angle of a vector. 

Ai? — Distance in the rj-cj) space: Ai? = 
■y/ lS.rf + A(/)2. Equation Ai? < c, with respect to 
a certain direction and with some constant c, defines 
a circle in the rj-cj) space. In the context of jet 
reconstruction algorithms, such circle is usually referred 
to as a "cone". 
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Pt — Transverse momentum; pT — psmO, where p 
is the momentum magnitude. 

Ej- — Transverse energy, usually defined by Et ~ 
Es'mO, where E is the particle energy. 

— Missing transverse energy. It approximates 
the transverse momentum carried away by neutrinos in 
the assumption of a fully hermetic particle detector. 

Hx — Scalar sum of the Et of all charged leptons 
and jets in the event added to the fix- 

Mt — Experimentally measured mass of the top 
quark. Unless noted otherwise, Mt = Mmc- 

mt — An estimate of the top quark mass obtained 
in a single collider event. 

J Cdt — Integrated luminosity accumulated by an 
experiment. C is the instantaneous luminosity. 

2. Top production and decay 

A precision measurement of the top quark mass relies 
heavily on the SM predictions for top production and 
decay processes. It is therefore essential to validate 
underlying physics and detector response models by 
comparing experimental measurements with theoretical 
expectations. Any deviations found will have to be 
understood and properly represented in the models. 
Effects not taken into account explicitly are treated 
as sources of systematics uncertainties (as discussed in 
Section 6). 

2.1. Cross section for tt production 

The production of top-antitop pairs is a process that can 
be calculated in perturbative QCD. Figure 3 illustrates 
some of the leading order (LO) QCD diagrams that 
contribute to tt production. At the Tevatron, the process 
is dominated by the quark annihilation diagram (leftmost 
in the figure), whereas the gluon fusion diagrams 
contribute only 15% of the total cross section [19, 20]. 
In contrast, at LHC energies the gluon diagrams are 
expected to contribute more than the quark diagrams. 
The top cross section measured by CDF at the Tevatron 

g 'VWasaassw) » t 



of this writing. It is in a good agreement with the SM 
calculations. 





Figure 3. Some of the Feynman diagrams that 
contribute to tt production. 

[y/ s — 1.96 GeV/c^) as a function of the top quark mass is 
shown in Figure 4 [21]. The calculations were performed 
at NLO+NLL (next-to-leading log) order (Cacciari et 
al.) [22], at NLO-hNNLL (next-to-next-to-leading log) at 
NLO and approximate NNLO by (Langenfeld et al.) [11] 
and by (Kidonakis et al.) [23]. The measurement point 
is placed at Mt = 172.5 GcV /c? because this mass value 
was used to model tt production [24]. This measurement, 
att — 7.50 ± 0.48 pb, is the most precise at the time 




CDF Run II (4.6 tb-^) 
Langenfeld, Moch & Uwer, arXiv:0906.5273 (2009) 
Cacciari et al., arXiv:0804.2800 (2008) 
Kidonakis & Vogt, arXiv:0805.3844 (2008) 
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Figure 4. Top cross section vs. top quark 
mass as calculated by different authors [11, 
22, 23]. For each computation, the middle 
curve shows the central value for the theoretical 
prediction, while the upper and lower curves 
show the ztlo" computations respectively; the 
curves from the last two authors are very 
similar. The data point [21] is the average 
of two CDF measurements which use the Z 
boson production cross section to eliminate the 
luminosity systematics [24]. 



Measured properties of tt events are in agreement 
with SM expectations (see recent reviews of Tevatron 
top-quark physics results [25, 26] for more details). Latest 
confirmations include the correlation between the spins of 
the top and antitop quarks [27, 28] , the fraction of gluon- 
gluon contribution to the tt production cross section [29] , 
and measurements of the tt differential cross sections [30, 
31] (see Section 4 for a plot). One known exception 
is the forward-backward asymmetry in top and antitop 
directions. Recent measurements by both CDF [32] and 
D0 [33] differ by ~ 3 standard deviations from the 
SM expectation calculated at NLO [34] and approximate 
NNLO [35]. This anomalous effect is currently under 
further investigation. 

2.2. Top decay modes 

Within the Standard Model, the expected top quark 
width is ^ 1.3 GeV and its lifetime is about 0.5 x 10~^^ 
seconds. With such a short lifetime (<C 1/Aqcd), top 
quark decays before hadronizing. Observation of its 
decay products allows for a direct measurement of its 
mass, a unique feature among quarks [8]. The top quark 
is expected to decay almost exclusively into a W boson 
and a bottom quark, so that the intermediate state of 
a tt event is W^bW~b. Each of the two W bosons 
decays further into a charged lepton and a neutrino or 
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into a quark-antiquark pair (multiple flavor and color 
assignments are possible), as illustrated in Figure 5. The 
quarks produced in W decays are not observed as such, 
but they hadronize producing jets of particles which are 
subsequently observed and measured in the detectors. At 
leading order, 10.6% of the events will have two charged 
leptons (e, /Lt, r), two neutrinos and two jets; 43.9% of the 
final states will have one charged lepton, one neutrino 
and four jets, and 45.5% will have six jets. Neutrinos 
escape direct detection but their presence can be inferred 
via an excess of missing transverse energy in the event. 
While W — ^ tvg decays, with i = e or fi, result in 
unambiguous experimental signatures, the case W — tj/.^ 
is more complicated as tau leptons subsequently decay in 
a variety of ways. According to the number of electrons 
and muons produced in the W decay chains, the final 
states ( "topologies" ) of the ti system that lead to distinct 
experimental signatures are called "dilepton","^+jets", 
and "all-hadronic" . Taking into account ~ 35% leptonic 
decay branching fraction of the t, only 6.4% of the events 
end up in the dilepton category. Similar considerations 
affect the £+jets topology which accounts for 34.1% of 
the events when hadronic r decays are excluded. 
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Figure 5. Final states of the tt system. 

Measurements of branching fractions of top quark 
decays [36, 37] have confirmed that the top quark decays 
predominantly into Wb, as predicted by the SM. A 
number of decay modes of the top quark have been 
searched for and excluded: decay to a charged Higgs 
boson (at the level of 10%) and decays through Flavor 
Changing Neutral Currents (at the level of 10^''), as 
summarized in the recent reviews [25, 26]. Other 
properties of top quarks and their decays, such as W 
boson helicity fractions [38, 39], top charge [40], and 
top width [41, 42], have been studied as well, and no 
significant deviations from the SM expectations have 
been found. 

3. Identifying top events 

Stringent requirements are imposed on detectors used for 
precision measurements of the top quark mass. High 



resolution charged particle tracking together with high 
granularity, precise energy determination in calorimeters 
is necessary for both lepton and jet measurements. 
Muon momentum measurements and identification are 
accomplished, in addition to good tracking, with 
dedicated muon systems located outside the calorimeters. 
Jet energy measurements require precision calorimetry 
and good segmentation, as jet shapes play an important 
role both in the fJx measurement and in disambiguation 
between quark and gluon jets (see Section 7.2). 
Hermeticity is essential for the measurement. High 
resolution tracking is also needed to identify bottom jets, 
in particular by reconstructing secondary vertices in jets. 
Backgrounds to tt production are considerably reduced 
by requiring 6-jet presence in the event (see Section 3.5). 

3.1. Detectors 

The CDF detector took its first data in 1985, whereas 
the D0 detector took its first data in 1992. The top 
quark was discovered in 1995 and the first direct top mass 
measurements were made with the initial configuration 
of these detectors (Tevatron Run I measurements). Both 
the detectors and the Tevatron accelerator complex went 
through major upgrades during the 2000-2001 shutdown. 
Precision measurements of the top mass were performed 
with the upgraded detectors installed for Run II at the 
Tevatron. 

The Run II CDF detector is shown in Figure 6 [43]. 
Precision tracking is achieved with: large number of 




Figure 6. The CDF detector componenents 
from the coUision point outward: the silicon 
system (purple), the central outer tracker 
(inner part in orange), the solenoid (yellow), 
the electromagnetic (red) and hadronic (blue) 
calorimeters, and the the muon chambers 
(green) [43]. 

points along the track, high precision space point mea- 
surements, long lever arm, and closeness to the produc- 
tion vertex for precision impact parameter determina- 
tion. Placement of silicon detectors in close vicinity of 
the beam pipe has allowed CDF to achieve a single- 
particle transverse momentum determination precision 
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'^^^ = 0.07%pt when the primary vertex is included in 
the track fit [44] . The CDF tracking system consists of a 
sihcon microstrip detector and an open-cell drift chamber 
immersed in a 1.4 T solenoidal magnetic field. The silicon 
detector (SVXII) consists of five double sided cylindrical 
layers of detectors augmented by an inner layer mounted 
on the beam pipe and two more layers on the outside. 
They provide precise charged particle tracking in the ra- 
dial range from 1.5 to 28 cm. Outside this region, the 
Central Outer Tracker (COT) extends the tracking sys- 
tem to 137 cm providing 96 additional points for track 
reconstruction. In combination, the COT and the silicon 
detectors provide excellent tracking up to a pseudorapid- 
ity \ri\ ^ 1.0. Additional layers of silicon detectors extend 
tracking coverage to |?7| ~ 2. 

The central electromagnetic (CEM) calorimeter 
contains Xq > 18 radiation lengths of lead-scintillator 
layers. Proportional chambers embedded at a depth 
of ~ 6Xo (shower maximum) provide shower shape 
information for electron identification. The resolution 
of the CEM calorimeter for electron measurements 
is '^^Et'' ~ ® 1-5%. Projective geometry is 

used: dense segmentation in the CEM, coarser in the 
hadronic calorimeter. The central hadronic calorimeter 
is composed of alternating layers of iron plates and 
scintillators for a total of 4 nuclear interaction lengths. 
This gives a resolution for charged pions ^^^^ — © 
3%. In the forward region (1< \ri\ < 3.6) a tile calorimeter 
is used, with an electron resolution = y=© 1-0% 
for the electromagnetic component and a pion resolution 
^^^B^ = ® 5% for the hadronic component. An 
additional hadronic calorimeter covers the region between 
the the central calorimeter and the plug calorimeter, thus 
providing hermeticity of the detector. Its resolution is 
"^^^■^ = © 4% for charged pions that do not interact 
in the CEM [45]. 

The muon chambers coverage extends only to \ri\ < 
1.0. In the center, two muon detectors cover the jyyj < 
0.6 region. They consist of four layers of proportional 
chambers each, with the magnet return yoke in between 
them, thus providing an additional 60 cm of steel 
absorber. An additional set of four layers of drift 
chambers cover the region 0.6 < \r]\ < 1.0. 

The D0 detector is shown schematically in Fig- 
ure 7 [46, 47]. The tracking system consists of a silicon 
microstrip tracker and a central fiber tracker inside a 2 
T solenoid. The silicon detector includes four layers of 
single and double sided detectors in the central region. 
In 2006 an additional layer of silicon sensors was added 
on the beam pipe (the data before and after this addition 
are identified as Run Ila and Run lib). In the forward 
direction disks equipped with silicon detectors complete 
the tracking system. Outside of the silicon detector sys- 
tem, 16 layers of scintillating fibers (8 axial and 8 stereo), 
placed in the radial region 20-52 cm, provide additional 
tracking information. The combination of the tracking 
devices provides efficient tracking up to pseudorapidity 
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Figure 7. The D0 detector [46]. 

< 3. Tracking resolution is = 0.2%pT © 1-4%. 

A preshower detector is placed outside of the magnet 
and inside the calorimeter. The D0 calorimeter is 
a Uranium-Liquid-Argon system inside a cryostat. It 
consists of a central and two endcap components. The 
EM part has a depth of ~ 20 radiation lengths. The 
energy resolution for electrons is ^^^^ = y= © 4% in 
the central calorimeter. In the End Cup calorimeter 
the resolution is ^ = ^ © 4% [48]. The depth of 
the hadronic section varies from 7.2 nuclear interaction 
lengths (A) at [ryl = to A = 10.3 at \ri\ = 1. The 
calorimeter extends to \ri\ — 4 with a region of low 
efficiency at 1.0 < \r]\ < 1.4. 

The muon system surrounds the calorimeter and 
consists of tracking detectors and scintillators covering 
a pseudorapidity region up to \ri\ = 1.0. A toroidal 1.7 T 
iron magnet completes the central muon system. The 
forward muon system covers the region 1.0 < < 2.0. It 
consists of mini drift chambers and two toroidal magnets 
with 1.6 T average field. 

Both CDF and D0 use a three-level trigger system 
to select the events to analyze. The Tevatron delivers 
collisions to the detectors every 396 ns, which means, at 
the present luminosity (^ 3x 10'^^ cm~^ s~^), about 2.5 x 
10^ bunch crossings per second, with an average of five 
collisions per crossing. The first level hardware trigger 
reduces the event rate to - 10 kHz/2 kHz (CDF/D0). 
The second level uses trigger processors that reduce the 
rate to 200 Hz/1000 Hz (CDF/D0). Finally, the third 
level is based on limited event reconstruction that reduces 
the rate to 40 Hz/50 Hz (CDF/D0). Digitized detector 
readouts for the accepted events are stored on tape for 
subsequent offline analysis. 

The two detectors have adequate capabilities to 
perform many precision measurements as well as to 
explore the vast physics landscape at hand. 

3.2. Lepton identification 

Electron, muon and neutrino (-^t) reconstruction utilizes 
event information from all detector subsystems. Elec- 
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trons are identified using the tracking system (pt), the 
electromagnetic calorimeter {Et), the electromagnetic 
shower shape, and the information from the hadronic 
calorimeter. CDF obtains the shower shape informa- 
tion from proportional chambers located at shower max- 
imum [43], whereas D0 uses information from finely 
segmented layers in the liquid argon calorimeter [49]. 
Electrons that are likely to come from photon conver- 
sions arc removed using appropriate algorithms. Both 
experiments have requirements on the E/p ratio to re- 
duce backgrounds from QCD jets. For electrons com- 
ing from W decays, an isolation requirement is imposed 
after all other information is processed. This require- 
ment, which helps in rejecting jets faking electrons, con- 
sists in vetoing significant additional energy in a cone 
of Ai? = 0.4 radius around the electron direction. For 
both experiments, there are several levels of electron cri- 
teria: "tight" , "medium" , and "loose" , but most of the 
top mass measurements use the "tight" electron require- 
ments. Both experiments use Z e+e~ decays to cali- 
brate the electron energy. 

Muons are identified using information from the 
muon chambers, the tracking system, and the calorime- 
ters. Having found a signal in the muon chambers, a 
"muon stub", i.e., a track segment, is reconstructed. 
Next step is to match this segment to a track found in the 
tracking system that extrapolates to the muon chambers 
within a small distance of the segment. This distance is 
different for the different components of the muon sys- 
tem for both CDF [43] and D0 [49], as it depends on the 
resolutions of the chambers and the tracking system as 
well as on the distance between them. The track is also 
required to originate from the event primary vertex. The 
energy deposited in the EM and hadronic calorimeters 
by the muon is subject to requirements that are different 
for the two experiments. For muons produced in W de- 
cays, a calorimeter isolation as well as a track isolation is 
required in both experiments. Muons from cosmic rays 
are removed. The absolute energy calibration is obtained 
with Z — > jJL^ ijT decays. 

The missing transverse energy, ^t, is determined 
from the transverse momentum imbalance in the event. 
It is calculated by adding vectorially all the calorimeter 
towers, with the direction defined by the vector 
connecting the primary vertex to the center of the tower. 
Corrections to this sum are made for jets and muons 
in the event. The muon correction is obtained by 
subtracting the energy deposited by the muon in the 
calorimeter and by adding its track momentum to the 
vectorial sum of the tower energies. The energy of the 
jets in the event is corrected by various coefficients, 
as described in the next section, and the vector is 
readjusted to take this correction into account. 

3.3. Jet reconstruction and calibration 

Hadronic jets in tt events are reconstructed from the 
energy flow data collected by the CDF and D0 respective 



calorimeters. Initially, all energy depositions are grouped 
into projective "towers" whose size is consistent with the 
hadronic calorimeter granularity. All towers associated 
with identified electrons are excluded, electronic noise 
is subtracted. The remaining towers are clustered 
into jets using seeded variations of the iterative cone 
algorithm [50] (which is known as the "mean shift" 
algorithm [51] in the pattern recognition literature). 
Although a number of deficiencies have been discovered 
in the cone-based approach to jet reconstruction [52, 
53], extensive studies of various particle processes 
performed with this algorithm resulted over time in 
good understanding of jet properties and in consistent 
calibrations. Fixed jet size permits a very simple 
correction for the presence of underlying event and 
multiple interactions occurring in a single bunch crossing. 

The r^-cj) cone radius used by CDF to reconstruct jets 
in tt events, Ai? = 0.4, was chosen to optimize efficiency 
and to minimize energy sharing among different jets. The 
CDF jet energy resolution is approximately (t(£'t) = 
(0.1 {Et/GcN) + 1.0) GcV for a 0.7 cone radius [45]. 
The detector response is simulated with the GFLASH 
parametrization [54] interfaced with GEANT3 [55]. 
Test beam data, in addition to pion and electron 
measurements from collider data [56], are used to tune 
the GFLASH parameters. The E/p distributions at 
several energies for pions and electrons are compared 
to the Monte Carlo expectation to obtain the tuning 
parameters. The jet energy scale is set by the 50 GeV/c 
test beam data point for pions and by the Z — > e+e" 
collider data for electrons. The jets are corrected for non- 
linear response of the calorimeter; poorly instrumented 
regions; "multiple interactions", i.e., the extra energy 
from the additional collisions occurring in the same bunch 
crossing (pile- up); "underlying event", i.e., the energy 
from the pp remnants; and "out-of-cone" energy. After 
tuning the simulation to the individual particle response, 
an "absolute" jet correction is derived from simulation. 
This correction is obtained by comparing the jet pT at 
the particle level to the jet pt after simulation. The 
same cone algorithm is used in both cases. The derived 
correction is valid if the number of tracks and the pT 
distribution of the tracks in data and in simulations 
are the same. This was verified using PYTHIA Monte 
Carlo (MC) samples with jet px up to 600 GeV/c. The 
uncertainty of this procedure contributes 1% to the jet 
systematic uncertainty, independent of pt- Calibration, 
done in the central part of the detector (0.2 < \r]\ < 
0.6), is extended to all the rj regions using a dijet 
imbalance algorithm. Systematic uncertainties arise from 
uncertainties on the corrections, stability of calorimeter 
response, calorimeter simulation for EM particles, and 
calorimeter simulation for hadrons. The different 
components that contribute to the jet energy scale (JES) 
uncertainty are shown in Figure 8. The corrections 
and systematics have been validated using several data 
samples: dijet, 7+jets, Z+jets, and with hadronic W 
boson decays in tt events [56] . 
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Figure 8. Systematic uncertainties on ttie jet 
energy scale for the CDF experiment [56]. 



The D0 jet energy resolution is a{ET) = 9% at 
100 GeV for a 0.7 cone radius. The Pt dependence 

PT V pt PT 



C, where 



is parameterized as 

iV determines the magnitude of the noise term, S is 
the stochastic term, and C is a constant. The three 
parameters are evaluated at several rj intervals [57]. D0 
uses the "midpoint" version of the cone algorithm with 
a radius of 0.5 for top analyses. The multiple interaction 
correction is determined from events taken with a zero 
bias trigger during physics data taking. It accounts 
for noise and energy pile-up. Additional corrections for 
muons in jets and for radiation outside of the cone are 
also made. The jet response for jets with rj < 0.4 is 
derived from a high statistics 7-l-jets sample, using the 
Pt imbalance of these events. The photon energy scale is 
assumed to be the same as that of the electrons calibrated 
with Z — > e~^e~ collider data. The extension beyond 
r] — 0.4 is obtained by px imbalance of dijet events, with 
one jet in the ij < 0.4 region. 

The methods employed for the top mass measure- 
ments use jets of particles obtained with Monte Carlo 
generators followed by a GEANT-based simulation [55]. 
The jet energy scale is evaluated in the data and in the 
simulation. The differences are found to be independent 
from jet px and r] and are taken into account in the sys- 
tematic uncertainties [58, 59]. Figure 9 shows the major 
components of the systematic uncertainties. 

Both CDF and D0 use hadronic W decays to 
determine an overall value of the jet energy scale from the 
tt data, as will be described in more detail in Sections 6 
and 7. 

3.4- Tagging b jets 

One very important ingredient of precision top mass 
measurements is identification (tagging) of heavy flavor 
jets. It is essential for suppressing backgrounds from 
QCD processes as well as for reducing the combinatorics 
in top events with high jet multiplicity. 
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Figure 9. Systematic uncertainties on the jet 
energy scale for the D0 experiment [46]. 



Tagging of heavy flavor jets is done in several ways: 
(a) by identifying muons or electrons originating from 
bottom or charm hadron decays within the jet; (b) by 
reconstructing a secondary vertex in jets, due to the 
decay of a heavy flavor hadrons; (c) by identifying tracks 
within the jet that originate away from the primary 
vertex. Most of the top mass measurements performed 
at the Tevatron use methods (b) and (c) for 6-jet 
identification, i.e., use the Lxy distance of the secondary 
vertex or the impact parameter (do) of each track, as 
illustrated in Figure 10. 



Displaced 
Tracks 




Figure 10. Secondary vertex reconstruction. 
Lxy is the distance of the secondary vertex 
from the primary vertex in the plane orthogonal 
to the proton beam direction. The impact 
parameter of a track is marked as do. 

The impact parameter resolution of the CDF 
tracking system is 35 /xm for tracks with pT > 
2 GeV/c [44]. D0 tracking provides impact parameter 
measurements with respect to the primary vertex with 
a precision between 20 and 50 /im, depending on 
the number of hits in the silicon detector [60]. The 
reconstruction of the secondary vertex follows a similar 
procedure for the CDF [61] and D0 [60] algorithms. A 
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primary vertex for the event is reconstructed using tracks 
with a small impact parameter significance, do/adg < 
3. The tracks in jets within a cone of Ai? < 0.4 
(CDF) or 0.5 (D0) and with large impact parameter 
significance do/cTdo > 3.5 are used to find a secondary 
vertex. Additional criteria are set for the significance of 
the Lxy measurement. A jet is tagged as a heavy-flavor 
jet if all criteria, including a good fit of the secondary 
vertex, are met. The probability that the displaced vertex 
is produced by a charm jet is also evaluated, assessing 
the invariant mass of the tracks that form the secondary 
vertex. Both MC and data samples are used to evaluate 
the efficiency of the tagger in top events and the mistag 
rate, i.e., the probability that a light quark jet is tagged 
as a heavy flavor jet. The CDF &-tagging efficiency as 
a function of jet Et is shown in Figure 11 [62]. The 
efficiency for a "tight" 6-tagging requirement used for the 
top mass measurements is about 40%. The mistag rate 
is increasing from 0.4% at 15 GeV to 2.3% at 180 GeV. 

SecVtx Tag Efficiency for Top b-Jets 
0.7, 



jets in two different regions of px and rj. 



0.6 

0.5; 
0.4: 
0.3: 

0.2 
0.1 





I Loose SecVtx : 
I Tight SecVtx : 



20 40 60 80 100 120 140 160 180 200 

Jet Et (GeV) 

Figure 11. The CDF b-tag efBciency for 
"tight" and "loose" 6-tagging requirements [62] . 
The average Ex of b jets in tt events produced 
at the Tevatron is ~ 70 GeV. 

Both CDF and D0 have developed more sophisti- 
cated algorithms for 6-tagging. They use more informa- 
tion than just the secondary vertex fit result. These neu- 
ral network (NN) algorithms help reduce the background 
from light quark jets that mimic heavy fiavor jets. The 
CDF algorithm [63] employs other variables in addition 
to secondary vertex information, but has not been used 
for top mass measurements. The D0 NN algorithm [64] 
was used for top mass measurements. It takes into ac- 
count information from impact parameter measurements, 
the secondary vertex tagger, and the soft lepton tagger 
(SLT). The SLT attempts to identify semileptonic decays 
within a jet. Separate efficiencies for b and c quark jets as 
a function of px and 77 are obtained, and the mistag rate 
as a function of the same variables is evaluated [64] . The 
NN is trained on QCD bb jets and on light quark Monte 
Carlo samples, and its performance is measured in data. 
Figure 12 shows the fake (mistag) rate-vs-efficiency for 
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Figure 12. D0 6-tag efficiency vs. 
for two px and 77 regions [60]. 



fake rate 



S.5. Major background sources 

There are two substantially different types of back- 
grounds: physics processes that have topologies similar 
to tt events and instrumental backgrounds originating 
from processes that have incompatible final states but 
can nevertheless appear top-like due to detector imper- 
fections. Physics backgrounds are difficult to eliminate 
completely, but their rates can be usually reduced by im- 
posing requirements that exploit the differences between 
signal and background kinematic distributions. JVIodeling 
of physics backgrounds is discussed in Section 4.2. In- 
strumental backgrounds are caused by misidentification 
of charged leptons, neutrinos {fix), or 6 jets. These back- 
grounds can be reduced by stringent requirements on par- 
ticle identification criteria (charged leptons), topological 
cuts for the fix (neutrinos) , and sophisticated algorithms 
for 6-tagging. 

Backgrounds are different for the three major 
topologies in which the tt candidate events are found. 
A brief description of the major backgrounds in each of 
these topologies is provided below. Additional details 
will be given in Section 7, where the actual top mass 
measurements are discussed. 

The background in the dilepton topology (c and /^) 
is mostly due to Drell-Yan processes {Z/j* i^i^)', 
diboson production: WW and WZ where a lepton from 
the Z decay is lost and appears as fix (oppositely charged 
leptons of the same type coming from Z decays are 
rejected with a Z mass veto); W+ > 2 jets (where 
one jet mimics an electron or a muon); single top 
production (lepton -|-> 3 jets where one jet mimics 
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an electron). In addition, "fake" muons can originate 
from punch-through or from hadrons decaying in flight. 
Details of instrumental backgrounds for electron and 
muon identification can be found in [65, 66]. The ti 
decay chains with a r lepton are included in the f+jets 
or dilepton channels only if the r decays leptonically (in 
~ 35% of the cases). Studies of the processes involving 
the T lepton are discussed in more detail in Section 7.3. 

The major background in the ^-|-jets topology is 
VF+jets production which has a much larger cross section 
than ti production. This background is further classified 
into TV+light flavor and VF+heavy flavor. The first one 
can be reduced considerably by requiring one or more 
jets to be tagged as a 6 jet. Another large background 
(often called non-VF QCD) is due to multijet production, 
where a jet fakes an electron or a muon. Rate estimations 
for these backgrounds are performed with data driven 
methods, developed for top cross section measurements. 
Details can be found in [67, 24] for the CDF analyses and 
in [64] for D0 analyses. A good review of these methods 
can be found in [25] . Other instrumental backgrounds are 
due to mismeasurements of jets which occur in poorly 
instrumented regions of the detector and increase the 
value of f^T- Requiring a certain minimum angle between 
the jet and the direction helps to suppress these 
backgrounds. Finally, diboson production, i.e., WW, 
W Z and ZZ final states with the same topology as tt, and 
single top production also contribute to the background. 
The dependence of the background rate on the number 
of ^-tagged jets can be seen in Figure 13. See Section 7.1 
for the contributions of each of these backgrounds to the 
candidate event samples after application of all selection 
requirements. 



which combines a number of discriminating variables, are 
used to suppress the background further, as described in 
Section 7.2. 

Table 1 shows the signal-to-background ratio (S/B) 
for simple event selection criteria as a rough estimate. 
For the dilepton and -^-l-jets channels: Exileptons) > 
20 GeV, fix > 20 GeV, jets with Et > 20 GeV are 
required; for the all-hadronic final state Et > 15 GeV for 
the 6-8 jets in the event is required. The values of S/B 
for the ^-|-jets channel can be improved using additional 
kinematics requirements, similar to those used for the 
all-hadronic topology. See Section 7 for the background 
rates achieved in various analyses. 

[hbtp] 

In summary, the ^-fjets channel is the most 
propitious for measuring the mass of the top quark at 
the Tevatron. With the 6-tagging requirement, there 
are three signatures for signal identification (a charged 
lepton, large fix, b hadrons in the event). With the 
appropriate selection, the S/B ratio can be reasonably 
high, and from the branching ratios 34% of tt events 
are expected to manifest themselves as e, /i + jets. 
In the dilepton channel there are fewer events but the 
S/B ratio is good even without 6-tagging. Tagging 
of 6-jets can help increase the S/B ratio even further. 
Finally, the top mass measurement in the the all- 
hadronic channel, with optimal choice of variables used 
for background discrimination, has recently surpassed 
the precision obtained with the dilepton events (see 
Section 7.2). 

4. Monte Carlo modeling of signal and 
backgrounds 



D0, L=5.3 fb"' 



-^Data 

■ Other 

■ W+jets 

■ Multijets 




1 >2 

Number of b-tagged jets 

Figure 13. Number of candidate tt events vs. 
the number of tagged jets in the £+]ets topology 
for events with at least four energetic jets [68]. 

QCD multijet production is the major background 
in the all-hadronic topology. Without 6-tagging 
requirements, this background is three orders of 
magnitude larger than the signal. The sample signal 
fraction is significantly improved by demanding that 
at least two jets are tagged as a 6 jet. Additional 
kinematic requirements, e.g., a neural network selection 



To perform a precision measurement, it is important to 
have the most complete understanding of the physics 
under study as well as the best possible description of 
the detector response formalized through its computer 
simulation. The CDF and D0 collaborations use Monte 
Carlo generator programs to model a number of physics 
processes relevant to tt production and decay: the 
hard scattering of the incoming partons, multiple parton 
interactions, underlying event arising from the remnants 
of the pp system, color reconnection of final state partons, 
multiple interactions which happen in the same bunch- 
crossing, parton shower, and subsequent hadronization. 
Events are subsequently passed through a complete 
detector response simulation. The resulting simulated 
samples are treated just like the recorded pp collision 
data, using the same reconstruction software and particle 
identification algorithms. 

A variety of Monte Carlo generators are used 
to model the signal and the background and thereby 
to estimate the event selection efficiencies, kinematic 
distributions, etc. Among the generators that include 
all processes mentioned earlier are PYTHIA [17] and 
HERWIG [69]. These are leading order tree- level 
generators of 2 2 parton processes. The parton 
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Table 1. S/B for different channels. All samples have a nominal of 175 GeV/c^. 



Sample 


Dilepton 


£-|-jcts 


All-hadronic 




(e,M) 


(e,M) 


■\TM 1 J.* 

IN IN selection 








/o 


Signature 


2£+2j/+2 jets 


U+liy+4: jets 


6 jets 


Major Back. 


DY, W/Z+ieXs 


QCD, VF/Z+jets 


QCD multijets 


0-6-tags S/B 


1/1 


1/4 


1/20 


1-5-tags S/B 


4/1 


4/1 


1/5 


2-6-tags S/B 


20/1 


20/1 


1/1 


Events in 1 fb~^ 


25 


180 


150 (2 b-tags) 


( > 1 b-tag) 









shower is modeled using DGLAP (Dokshitzer-Gribov- 
Lipatov-Altarelli-Parisi) evolution [70, 71, 72], while the 
hadronization process is simulated using a string model 
in PYTHIA and a cluster model in HERWIG. Decays of 
tau leptons are simulated using the TAUOL A library [73] . 
Decays of hadrons containing c and b quarks are handled 
with the EVTGEN package [74]. Other generators, 
such as ALPGEN [75] and MadEvcnt [76], generate only 
the hard scattering reaction and rely on PYTHIA or 
HERWIG for simulation of other processes. 

4.I. Modeling of fl events 

The CDF collaboration employs PYTHIA v6. 2 as its 
default Monte Carlo generator to model tt production 
and decay as well as parton fragmentation and 
hadronization. The D0 collaboration uses instead 
ALPGEN v2 [75] in which the probability to radiate 
additional partons is calculated according to the tree- 
level matrix elements of the relevant processes. The 
subsequent showering and hadronization is performed 
with PYTHIA which requires the use of a matching 
scheme to avoid double-counting of possible final 
states [77]. Other Monte Carlo packages, such as 
HERWIG, MadEvent [76] and MCQNLO [78], are 
used for studies of systematic uncertainties. The 
underlying event is modeled with PYTHIA, with relevant 
parameters tuned to reproduce CDF Run I and Run II 
observations. This set of parameters is commonly 
referred to as Tune A [79, 80]. PYTHIA v6.4 includes 
color reconnections models tuned to collider data [81] 
that have been employed in systematic uncertainty 
studies. 

Multiple hadron-hadron interactions within the 
same bunch-crossing are modeled with PYTHIA by the 
CDF collaboration, while the D0 collaboration takes 
extra interactions into account by overlaying data from 
random pp crossings on top of the MC events. CDF 
normally uses the CTEQ5L [82] set of parton distribution 
functions (PDFs), while D0 uses the CTEQ6L1 set [83]. 
Other PDFs used are CTEQ6M [83], MRST72 and 
MRST75 [84]. 

The measurements described in this Review use 
up to 5.8 fb~^ of pp collision data. This corresponds 



to about 4 X 10 tt events produced at each detector 
site. The size of this dataset allows for a thorough 
validation of the signal kinematics and acceptance 
modeling. Measurements of the inclusive ti cross 

section [24, 68, 85, 86, 87, 88, 89] indicate that applying 
an NLO-to-LO K-factor to the PYTHIA/ ALPGEN 
simulations results in a very good agreement between 
predicted and observed rates of the tt signal and 
background processes. Measurements of ti differential 
cross sections [30, 31] attest to the correctness of 
recent Monte Carlo predictions which incorporate NLO 
and approximate NNLO computations, as illustrated in 
Figure 14. Consistency between observations and Monte 
Carlo predictions has been verified for a large number of 
experimental distributions, with two examples provided 
in Figure 15. 

4-2. Modeling of the physics backgrounds 

The CDF and D0 collaborations employ several Monte 
Carlo generators to model tt backgrounds processes. 
Both collaboration use ALPGEN for simulating IV+jets 
production, with parton showering and hadronization 
performed by PYTHIA. The event samples are generated 
with the renormalization and factorization scale, Q^, 
set to -f J^partonsPT- ^he Overall normalization 

of the VF-l-jets background is obtained from data by 
subtracting other physics and instrumental backgrounds 
and the tt signal as a function of jet multiplicity. The 
H^+jets background is subdivided into three exclusive 
categories according to the flavor of the partons produced 
in association with the W boson: (i) W + bb/cc is the 
agglomeration of all final states which include the W 
boson, the bb or cc quark pair, and any number of 
additional jets; (ii) W + c consists of events with a 
W boson produced with a single charm quark and any 
number of additional jets; and (iii) events in which the W 
bosons are produced together with light flavor jets. The 
relative contributions from these three classes of events 
are determined using NLO QCD calculations based on 
the MCFM generator [91]. 

The diboson processes (VTW, WZ, ZZ) are 
simulated using PYTHIA. Single top quark s- and t- 
channel production is simulated with COMPHEP [92] 
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Figure 14. Example tests of the top quark 
pair production modeling at CDF and D0. The 
left plot shows the differential cr^f as a func- 
tion of the invariant mass of the tt system, to- 
gether with the approximate NNLO SM predic- 
tions [30, 200]. The plot on the right shows the 
ratio between data and NLO theoretical predic- 
tion of the distribution of the top quark px, to- 
gether with approximate NNLO predictions and 
expectations from several Monte Carlo genera- 
tors [31]. 



for D0 measurements, and with IMadEvent [76] for 
CDF measurements. The contributions from these 
background sources are normaUzed to the corresponding 
NLO predictions. 

QCD multijet production is the major background to 
ti — >■ (7i92^'Z394^- The production of six or more partons 
is poorly understood at the theoretical level. In addition, 
the large multijet production cross section combined with 
the powerful background rejection needed to isolate the 
all-hadronic tt signal translates into a computationally 
prohibitive demand to generate an enormous number of 
events. For the above reasons, the multijet background 
is modeled using data in a region depleted of ti signal. 
Application of 6-tagging techniques is necessary in the all- 





5.6 fb"'ciata 




tf signal (172.5 GeV/c^) 




W+heavy flavor 




W+light mistag 




single top/diboson 




non-W QCD 




K-S CL = 0.29 



^0 40 60 80 100 120 140 160 180 200 
MET (GeV) 



> 
(3 



c 

lU 



D0, L=5.3 fb "" 




50 100 150 200 250 300 350 400 450 500 

H; (GeV) 

Figure 15. Distribution of the (left plot) 
modeled in the CDF top mass measurement [90] 
and Hip (right plot) in top-antitop events in the 
i+]ets final state in the D0 top cross section 
measurement. The H!p variable is defined as 
the scalar sum of the transverse energies of the 
jets and of the charged lepton [68]. 



hadronic channel to achieve a reasonable S/B ratio. CDF 
uses collisions with at least six jets in the final state to 
model the QCD background, and corrects for the bias 
introduced by 6-tagging using a parametrization of the 
6-tag rate derived with an independent dataset devoid of 
QCD events. The validity of this model is tested in large- 
statistics control samples deprived of top-antitop events. 

5. Data analysis techniques 

In this section we utilize the following notation. The 
symbol x is used to denote the collection of parton- 
level kinematic quantities needed in order to completely 
define an inelastic hard scattering reaction at the leading 
order perturbation theory {i.e., to specify a point in 
the reaction phase space). In all data analysis methods 
described, it is always assumed that points x in different 
collisions are independent and identically distributed 
(i.i.d.). The symbol y refers to one or more "observed" 
quantities inferred from detector data on event-by-event 
basis. Both the probabilistic nature of x and the 
nondeterministic detector response contribute to the 
randomness of y. Even though we call y an "observed" 
quantity, it is usually a product of a sophisticated 
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event reconstruction procedure which involves pattern 
recognition, tracking, clustering of jets, kinematic fitting, 
etc. It is commonly assumed that values of y arc 
also i.i.d., and efforts are made to ensure effective i.i.d. 
behavior if this is not the case|. 

5.1. Major issues for different ti topologies 

In the process of analyzing collision data for the 
purpose of measuring Mt, several important decisions 
have to be made about statistical and computational 
techniques applied, signal and background modeling, 
measurement calibration, trade-off between expected 
statistical and systcmatics uncertainties, etc. Different 
techniques adopted by different authors resulted in a 
variety of substantially distinct data analysis approaches, 
with both complementary and competing features. In 
each approach, the following major issues have to be 
considered and addressed in a consistent manner: 

— Choice of the ti final state. The traditional 
taxonomy of dilcpton, ^+jcts and all-hadronic 
final states dictated by distinct event kinematics 
and different levels of background contamination 
is maintained by most authors, although joint 
measurements which combine events with different 
final states started to appear in the literature [93, 94, 
65]. The main advantage of such joint measurements 
is the ability to apply the detector jet energy scale 
calibration obtained in the final states with hadronic 
W decays to the dilepton channel. 

— Event sample selection. Less strict event selection 
criteria can result in improved statistical uncertainty 
at the cost of increased background fraction and 
more complicated calibration. 

— Degree to which the method depends on the 
calibration of the detector jet energy scale — the 

largest source of systematic uncertainty in the most 
precise CDF and D0 measurements of M^. 

— Level of detail in the statistical model of detector 
response to jets (for methods which do rely on jet 

energy reconstruction) . 

— Mapping of the objects reconstructed in the detector 
to the leading order parton-level entities. The 
difficulty here is due to the fact that the charge 
and the flavor of jets produced in top decays and 
associated processes [e.g., initial and final state QCD 
radiation) can not be measured with certainty. This 
leads not only to the ambiguity of assigning jets 
to the tt decay products but also to the problem 
of choosing a correct set of jets when the number 
of jets observed exceeds the number of strongly 
interacting partons produced in the leading order 
perturbation theory. In the £+jets and dilepton 
channels, additional ambiguities arise when multiple 

X For example, both CDF and D0 apply jet energy corrections 
which depend on the number of primary vertices in the event (and, 
indirectly, on instantaneous luminosity). 



solutions of kinematic equations for the u momenta 
exist. In the top mass measurement literature, 

multiple acceptable mappings between partons and 
detected objects are called "permutations" . 

The feasibility of a simple kinematical analysis of the 
pp ^ tt ^ W^bW~h process can be determined 
using the following considerations. Charged Icptons 
in decays of W bosons are produced in association 
with neutrinos. Lepton presence, while allowing for 
powerful discrimination of QCD background, leads to 
complications in the reconstruction of event kinematics 
due to undetected neutrinos. We assume for the 
moment that the four-momenta of all charged final 
state partons are reasonably well measured, that masses 
of initial state partons and final state neutrinos are 
zero, that the transverse momentum of the incoming 
partons is zero, and that all of the event transverse 
missing energy is taken away by escaping neutrinos. 
With these assumptions, the neutrino 3-momenta and 
the longitudinal components of the momentum of the 
incoming partons are the only kinematical quantities not 
known, and then the total number of additional kinematic 
constraints needed in order to completely specify the 
reaction is 2 + Sn^, where is the number of neutrinos 
in the final state. Four of these constraints come from 
the energy-momentum conservation, and three additional 
constraints can be introduced by using the narrow width 
approximation for top and W: the invariant masses of 
the W decay products are required to be consistent with 
the mass of the W and the invariant masses of top 
decay products are set to Mt. Mt itself is considered 
unknown, so the latter requirement generates only one 
effective constraint for the two top quarks present in the 
process. With these seven constraints, the total number 
of kinematical degrees of freedom (DoF) is 3n^ — 5. 
This gives 1 DoF for the dilepton final state, —2 for 
£+jets (the system is ovcrconstraincd) , and —5 for the all- 
hadronic channel. This indicates that kinematic fitting 
techniques based on minimization can be utilized 
for the analysis of both (.+]v±s and all-hadronic final 
states. In the dilepton channel kinematical equations are 
underconstrained, so this channel has to be treated in a 
substantially different manner. Kinematical analysis of tt 
events in different final states is described in more detail 
in Section 5.3. 

From the statistical point of view, the result of 
any Mt measurement is an interval estimate of this 
parameter. Most authors utilize frequentist confidence 
intervals [95] by numerically studying the distribution of 
the point estimator with Monte Carlo simulations. Use 
of Bayesian credible intervals for top mass estimation 
has also appeared in the literature [96, 97, 98, 99]. 
Techniques used for constructing the point estimator 
itself are described in detail in the remainder of this 
section. 
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5.2. Methods based on distribution fitting 

An unbiased point estimator of can be obtained 
by fitting a distribution of some observed quantity y 
to a sum of signal and background contributions. In 
principle, any quantity whose distribution depends on the 
parameter of interest can be used to build an estimate 
of that parameter. In practice, one strives to choose y 
satisfying the following conditions: 

— The sensitivity of the combined (signal plus 
background) distribution of y to Mt is high. A 
quantitative measure of such a sensitivity can be 
provided, for example, by Fisher information [95]. 
All other factors being equal, a measurement which 
uses a distribution with higher sensitivity to Mt will 
have lower statistical uncertainty. 

— Modification of distribution nuisance parameters, 
such as detector calibration constants and sample 
signal fraction, does not lead to a noticeable change 
of the M^ estimate {i.e., off-diagonal elements of the 
Fisher information matrix which involve Mt should 
be small) §. 

It can be easily appreciated that, in the case of Mt 
measurements at CDF and D0, these conditions are 
contradictory. For example, an Mt estimate with 
high sensitivity can be obtained by calculating the 
invariant mass of three jets produced in the decay 
t — > Wb — > jjb (assuming for the moment that the 
jets can be chosen correctly). This invariant mass 
is proportional to the detector jet energy scale with 
coefficient of proportionality close to unity. As discussed 
in Section 3.3, a few percent relative uncertainty 
is typical for standard jet energy scale calibrations. 
This uncertainty immediately translates into a strong 
limitation on the precision of the Mt determination in 
such a measurement. 

For any particular Mt measurement technique, 
the relative contributions of statistical and systematic 
uncertainties vary with the amount of available data. 
To first order, the statistical uncertainty is expected to 

scale in proportion to 1/ J Cdt. Due to improvements 
in detector calibration and Monte Carlo simulations, the 
systematic uncertainty also decreases as / Cdt increases, 
albeit at a slower rate. Combined with the rich set of 
possible y choices, this variability resulted, over time, in 
a large number of studies which use distribution fitting 
for the Mt determination. Both signal and background 
distributions of y are constructed from Monte Carlo 
samples, for a set of discrete values of Mt. Recent 
works also include the jet energy scale as the second 
parameter, so that the set of distributions is initially 
defined on a 2-d parameter grid. These Monte Carlo- 
derived distributions are called "templates", and the 

§ It is always assumed that point estimators of M^ are properly 
calibrated and unbiased and that the presence of uncertain nuisance 
parameters is the sole reason for the systematic uncertainty of the 
measurement. 
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Figure 16. Reconstructed mt distributions 
from simulated pp — > tt events, for several 
different Mt values. Events in the samples used 
to build these templates have exactly one b- 
taggcd jet. The overlayed curves are continuous 
template parameterizations [114]. 

whole technique of distribution fitting is referred to as 

"the template method" in the top mass measurement 
literature. The following y quantities have been utilized: 

— Event-by-event top mass estimates m^. Due to its 
relative simplicity and adequate sensitivity, this is 
by far the most popular approach [100, 101, 102, 
103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 87, 
113]. Kinematical techniques used to obtain these 
estimates are presented in Section 5.3. An example 
set of mt templates is shown in Figure 16, taken 
from [114]. 

The transverse mass mT2 [115]- This kincirnatic 
variable approximates rrit in a manner appropriate 
for use with two missing neutrinos in the dilepton 
final state. 

— Combination of mt and W mass [116, 114, 117, 93]. 
The W mass is included in order to decrease the 
sensitivity to changes in the jet energy scale. Two 
mt values (which correspond to the best and second 
best jet-to-parton assignment) are used in [65, 118] 
which results in a 3-dimensional template. 

mt and the scalar sum of the pr of the four leading 
jets [119]. 

— mt and Ht [93]. 

— mt and mT2 [65]. 

— mt and its estimated resolution, a [112, 66]. 

Invariant mass of the charged Icpton and the b quark 
or energy of the two highest Et jets (early dilepton 
channel measurement [120]). 

Several template techniques have been developed with 
the explicit goal of minimizing the dependence of the 
Mt estimate on the calorimeter jet energy scale. These 
techniques rely on the tracking information alone which 
eliminates the jet-related systematic uncertainty but also 
results in a substantial degradation of the measurement 
sensitivity. The following quantities have been employed 
for this purpose: 
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Figure 17. The transverse decay length of 
6-tagged jets (left) and the transverse lepton 
momentum (right) for two different values of 
Mt separated by 50 GeV/c^ [122]. 

— The transverse decay length of 6-tagged jets, 
Lxy [121, 122]. Average Lxy is increased with Mt 
because h jets receive stronger transverse boost from 
heavier top quarks. 

— The transverse momentum of leptons (e and ^) from 
W decays [123, 122, 124, 125]. Example distributions 
of this and the previous quantity are shown in 
Figure 17. 

— The invariant mass of the charged lepton from the 
W decay and the muon from the semileptonic decay 
of the h quark [126]. 

A number of measurements of the top quark mass in the 
dilepton channel by the D0 collaboration [127, 128, 110] 
utilized a unique template approach which does not quite 
fit in any of the categories already described. In this 
approach, weights are assigned to all mt values in each 
event according to the z^WT algorithm (see Section 5.3 
for the description of this procedure) . In this algorithm, 
the tt production and decay processes are not modeled 
in sufficient detail and background contamination is 
not accounted for, so in practice the weights can not 
be treated as mt likelihoods. Instead, a method has 
been developed in which the probability of the complete 
mt weight distribution is estimated for each event as 
a function of Mt. For the purpose of dimensionality 
reduction, the normalized weight curve is split into Nb 
bins, weights are integrated in each bin, and the (iVt, — 1)- 



dimensional vector of integrated weights is used as y 
(due to the imposed normalization, N}, — 1 weights are 
independent). According to [127], this approach improves 
the statistical sensitivity of the measurement by w25% in 
comparison with mt templates. 

For any choice of y, initial template construction 
is performed by nonparametric density estimation 
techniques. Simple histogramming is sufficient in 
case y is one-dimensional, while multivariate templates 
are usually built by kernel density estimation [129]. 
Fitting of the template distributions to the observed 
data is performed by the maximum likelihood method, 
with MINUIT [130] being the common choice of the 
optimization and error analysis engine. To ensure 
proper fit convergence, it is highly desirable for the 
likelihood to have at least two continuous derivatives 
as a function of each parameter. However, Monte 
Carlo-derived templates are only defined for a discrete 
parameter set. Two distinct solutions to this problem 
have been identified: 

— A continuous parametric statistical model is fitted 
to the templates so that they become continuous 
functions, together with their first few derivatives, 
of both their parameters and y. In this approach, 
the likelihood for the data can be determined for 
arbitrary parameter values, and it is continuous. 

— The log-likelihood is initially determined for the 
parameter values for which templates were defined. 
Then the log-likelihood itself is interpolated to 
other parameter values using, for example, local 
polynomial regression [131], or by simply fitting the 
log-likelihood to a second or third degree polynomial 
near the peak. The smoothness assumptions are thus 
introduced directly into the likelihood curve. 

The second solution is also widely employed by the 
phase space integration methods described in Section 5.4. 
There, direct calculation of the likelihood for arbitrary 
parameter values becomes too expensive CPU-wise, so 
one has to resort to interpolation. 

Let's assume that a continuous representation of 
the signal and background templates, ^(yjMt,^) and 
i3(y]Mt,0), respectively, is available. Here, 6 represents 
nuisance parameters of the measurement other than the 
sample signal fraction (typically, jet energy scale). Then 
a simple likelihood for the observed sample can be written 
as 

L(A/t, 0, /.) - nf=i [/. '5(y,lMt, 9) , . 

+ (l-/,)B(y,lMt,0)], ^' 

where j = 1,2,...,N is the event number in the sample 
and fs is the sample signal fraction. The subsequent 
elimination of the nuisance parameters and fs is 
performed either by profiling or by marginalization. 
The profiling procedure consists in maximizing the 
likelihood (1) with respect to 9 and fs for each 
value of Mt. Marginal likelihood is instead calculated 
by introducing a prior distribution for the nuisance 
parameters (such a prior distribution usually takes into 
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account uncertainties of existing estimates of these 
parameters) and integrating the hkeUhood over 9 and fs 
with the prior. Mixed treatments have been utihzcd as 
well, in which the likelihood is initially profiled over 
and then 9 is marginalized. Relative merits of likelihood 
profiling and marginalization are discussed, for example, 
in [132]. 

Various modifications of the basic likelihood (1) 

have been employed in a mimber of Mt measurements. 
These modifications take into account uncertainties in 
the S{y\Mt,9) and B{y\Mt,9) shapes due to the limited 
number of Monte Carlo events available [103, 107] as 
well as binning effects in case histogramming is used to 
build the templates [104, 133]. In addition, the observed 
event sample is often split into several non-overlapping 
subgroups (which differ, e.g., by the number of 6- 
tagged jets) with different template shapes and expected 
background fractions. This allows for more precise 
template modeling in each subgroup as well as for the 
introduction of separate /„ priors. After elimination of 
the nuisance parameters by profiling or marginalization, 
the Mt log-likelihoods from different subgroups are added 
together to obtain the final result. 

The "ideogram" technique is a modification of the 
template method which attempts to take into account 
the resolution of the mt estimate on event-by-event 
basis [59, 134, 135, 136]. All Mt measurements which 
utilized this technique so far lacked a consistent likelihood 
formulation owing to the absence of the resolution prior, 
as discussed in detail in [137]. Perhaps, this is the reason 
why such measurements did not demonstrate convincing 
uncertainty improvements over results obtained with 
templates. 

The top quark mass can also be estimated 
indirectly from the measurement of the tt production 
cross section [138, 111, 139], assuming that the cross 
section dependence on Mt can be determined with 
sufficient precision from the Standard Model theoretical 
calculations. 

5.3. Kinematic reconstruction of the top mass 

In the £+iets and all-hadronic channels the number of 

available kinematic constraints exceeds the mimbcr of 
unknown quantities (including Mt itself), therefore these 
channels are amenable to kinematic fitting using 
minimization. The is constructed using transverse 
momenta of detected leptons and jets as well as the 
missing transverse energy: 

/ *, fit i,meas \ 2 

2 _ (Pt -Pt ) 



'.,4jets 



Ti:, — 



UEJit UB,meas\2 



-P 



(2) 



r? + r? 



This particular expression is appropriate for the £-|-jets 
channel (the notation follows [114]). Symbols with 
superscript fit denote fitted parton-level variables which 



determine leading order ti production and decay 
kinematics, while superscript meas refers to quantities 
measured in the detector. The quantities used to form 

this expression are: 

— p^rp are transverse momenta of leptons and quarks 
(fitted) or jets (measured). 

— p^^ are the transverse components of the unchis- 
tered energy. The transverse momenta of the jets, 
neutrino, and unclustered energy are related at the 

parton level hy p^ = — (^p^ + J2Pt^ +Pt^^^ as- 
suming that the initial transverse momentum of the 

colliding particles is 0. 

— M^^, Mjj are invariant masses of W decay products 
constructed using parton-level quantities. 

— Mh£^, Miijj are invariant masses of top quark decay 
products constructed using parton-level quantities. 

The for the all-hadronic channel is similar, with light 
jets used everywhere instead of £ and u, the first sum 
running over 6 jets, and the p^^ terms omitted [87, 105]. 

In (2), Gaussian resolution functions are employed 
for magnitudes of all transverse momenta, missing energy, 
as well as for the mass constraints. The method does 
not take into account angular resolutions of lepton 
and jet directions (these directions are assumed to be 
perfectly measured), asymmetry of jet p^ resolution 
functions, or expected event population in the reaction 
phase space. The is minimized with respect to 
all parton-level kinematic qiiantitics (magnitudes of the 
transverse momenta of h and light quarks, 3-momentum 
of the neutrino) as well as m^. This minimization is 
performed separately for all jet-to-parton assignments 
(permutations) consistent with the observed set of b 
tags. Typically, two solutions are obtained in the f-|-jets 
channel when kinematic equations are solved for the 
t — )■ Wb — >■ tuf^b decay sequence, thus increasing the 
number of effective permutations by a factor of two. The 
value of mt at the x^ minimum over all permutations 
and parton-level kinematic variables is usually taken as 
the top mass estimate for the event. Distributions of the 
X^ minimum values are illustrated in Figure 18 for the 
^-|-jets channel. 

Several modifications of this basic approach were ex- 
plored, typically resulting in a more complicated method 
with only a slight improvement in the measurement sen- 
sitivity: 

— Use of mt estimates from more than one permuta- 
tion [104, 140, 65]. 

— Inclusion of the jet angular resolution terms [101, 
104, 103, 119]. 

— Use of a separate jet energy scale factor for each 

permutation [119]. 

— Addition of a term which models the b tagging 
probability [103]. 

Replacement of Gaussian mass constraints with 
Breit-Wigner constraints [104]. 
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Figure 18. Minimum x 
events witti at least one b 
5.6 fb"-'^ event sample [65]. 
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Another possible modification that has not been tested 
yet in the context of Mt measurements consists in 
replacing the mass constraints in (2) with the term 
—2\n[p{-x\mt) Ptag], where p(x|Mt) represents the 
phase space density of the process normalized by the 

— — is the Jacobian 



total observable cross section, J ■ 

which relates the fitted quantities as well as the quantities 
which are assumed to be perfectly measured to the phase 
space variables, and ptag is the permutation-specific 
probability of the observed h tag configuration (see 
Section 5.4). The goal is to find the most probable values 
of all fitted quantities taking into account theoretical 
assumptions about the reaction and thereby to profile 
the rut likelihood over the complete parton-level phase 
space] I . 

For the dilepton channel, the number of constraints 
is insufficient to determine mt- It would be possible to 
infer mt if at least one additional kinematic quantity 
or constraint was available. This suggests the following 
method of mt determination: values of some unobserved 
kinematic quantities ^ are assumed, kinematic equations 
are solved, and allowed values of top mass, ?7i((^,y), are 
determined (in addition to ^, m((^,y) depends on some 
or all quantities y measured in the detector). ^ values 
are scanned within their kinematic limits. Obtained 
mt(^,y) values are assigned weights proportional to 
the probability density, p(C), of some other kinematic 
quantities C,{£,^y) (variable sets ^ and C may overlap 
partially, completely, or not at all). p{C,) is determined 
in advance, either from theoretical considerations or by 
estimating this density numerically using large samples 
of Monte Carlo events. While it is not strictly necessary, 
this method operates in the most transparent manner 
if p{C,) has little or no dependence on Mt. The 
distribution of possible mt values, p{mt), is thus formed, 
proportional to / p(C(Ci y)) <5(™t(?, y) ~ mt)d£,. The 
location parameter of this distribution (usually the mode) 
is used as the top mass estimate for this particular event. 
This approach is very general, and a judicious choice of 

II A very similar approach, albeit without proper normalization, is 
discussed in [141], 
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Figure 19. Weights for the individual events 
in the D0 Run I dilepton I'WT analysis as a 
function of mt [128]. 



^ and C variables will produce an estimate with good 
sensitivity to while maintaining a relatively simple 
statistical model for p(C) (inclusion of all parton-level 
phase space variables in both ^ and ^ leads to techniques 
described in Section 5.4). 

The following choices of ^ and ^ have been explored: 

' S, = (ri^,r]p): a two-dimensional variable, where 
rji, and rjp are pseudorapidities of the neutrinos. 
C = (Vi^tVpt Pt)i where is the two-dimensional 
missing transverse energy. This method is called 
neutrino weighting algorithm (i^WT) [128, 102, 107, 
110, 112, 66, 65, 142]. Weight distributions for the 
six dilepton events from [128] are shown in Figure 19. 

• f = (v'l^j'y'p), where ip^, and (^p are neutrino 
azimuthal angles in the plane transverse to the beam. 
A variable formed in a manner similar to (2) 
is utilized as C (since values of two variables are 
assumed, the number of constraints exceeds the 
number of unknowns by 1, so it becomes possible to 
construct a meaningful x^)- Weights proportional 
to are assigned to mt(^,y) solutions. This 
method is referred to as the neutrino ip weighting 
algorithm (PHI) [107, 108]. 

• f = {mt,Pi+,Pi-,Pb,Pb), C = X, where x 
represents all leading order parton-level variables 
which determine the reaction kinematics. Directions 
of the charged leptons and b quarks are assumed 
to be perfectly measured, while magnitudes of their 
momenta (inverse magnitude in the case of /i) are 
varied within their expected Gaussian resolutions. 
Neutrino momenta are calculated from kinematic 
constraints in the narrow width approximation for 
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t and W and zero transverse momentum assumption 
for the tt system. Values of mj = are weighted 
by the phase space density of the process at x, 
summed over all possible kinematical solutions for 
the neutrinos and jet-to-quark assignments. This 
approach is known as the matrix element weighting 
algorithm (7UWT) [143, 128, 110, 112]. It is a direct 
precursor of the phase space integration methods 
described in Section 5.4. 

• C = C = {pl\Eb,Ei, fJj.), where p** _is the z 
component (along the beam) of the tt system, 
while Eb and E^ are the energies of the b 
and b quarks, respectively. The probability 
density of C is factorized into the product of 
probabilities for individual components, with all 
marginals represented by Gaussian distributions and 
correlations neglected. This method is known as the 
full kinematic analysis (KIN) [107]. 

It appears that the Mt sensitivity of the AiWT and 
vWT algorithms is similar [112] and slightly exceeds 
that of both PHI and KIN methods [107]. In order to 
increase the overall sensitivity, results obtained in the 
dilepton channel on the same data with multiple template 
methods are often combined using the "best linear 
unbiased estimator" (BLUE) approach [144, 145]. In this 
case correlation coefficients between different methods 
are determined using common pseudo-experiments. 

5.4- Phase space integration m,ethods 

The technique of calculating event observation probabil- 
ities by integrating over all parton-level quantities in the 
reaction phase space; was introduced into the high energy 
physics data analysis practice in the pioneering Run I 
mcasiircmcnt of the top quark mass by the D0 collab- 
oration [96]. This method can be understood as an ap- 
plication of the Bayesian principle of integrating over all 
unobserved degrees of freedom with a well-motivated in- 
formative prior provided by the Standard Model theory 
{i.e., event-by-event marginalization of x). The prior is 
proportional to the matrix element squared of the pro- 
cess, so the technique is often referred to as the "matrix 
element method" (MEM). The probability of observing 
quantities y in a particle detector, Pev(y|a), is deter- 
mined according to 



Pev(y|a) = 5^/iPi(y|a), 



(3) 



where a is the set of model parameters. This set includes 
Mt and can also include detector jet energy scale as 
well as other theoretical and instrumental quantities, fi 
are the fractions of different non-interfering production 
channels consistent with y and constrained by fi = '^. 
In the context of Mt measurements, the probability to 
measure y in each channel i is most often estimated from 

X W,(y|x, a) |M,(x, a)|2 Ti{^, a) rfx, ^ ' 



P^{y\^) 



where the following notation is utilized: 

n(y) — Indicator function for the analysis accep- 
tance (1 for events which pass the event selection criteria, 
otherwise) . This term can be replaced by 1 in case only 
accepted events are considered. 

X — Variables which uniquely specify a point in the 
channel phase space x G 

dx — Differential element of the phase space 

cri(a) — Channel cross section: 



C7i(a)= / |Mi(x,a)|2Ti(x,a)dx. 



Ai{a) - Overall experimental acceptance for 
channel i. 

Wi(y|x, a) — Detector transfer function (TF). This 
is the probability density for observing detector response 
y from the space of possible measurements Y when 
the "true" phase space coordinate of the event is x. 
This function should be normalized for every value 
of «, X, and a either by iyi(y|x, a) c?y = 1 or 
by jyD.{y)Wi{y\x.,a)dy = ei(x,a), where ej(x,a) is 
the efficiency to detect an event with phase space 
coordinate x. The difference between these TF 
normalization conditions and their effect on TF modeling 
is discussed in [146]. 

|Mi(x, a)p — Squared matrix element of the 
process. 

Ti(x,a) — Other factors which do not depend on 
y {e.g., flux of colliding beams, parton distribution 
functions). 

In a compact symbolic notation, (4) can be 
represented as 

1 (^o-i,o6s(y]a) 



Pi{y\^) 



<:^i,obs{a) 



dy 



(5) 



where '''^''"^y ^^'^^ is the differential observable cross 
section for channel i. Parameter set a is estimated 
by maximizing the likelihood which is constructed by 
multiplying probabilities for all events in the sample: 

The denominator term 
d.,..UyM^y (total 



m = n,tii^ev(y,|a). 
cri,o6s(a) = ai{a)Ai{a) = Jy 
observable cross section) ensures proper likelihood 
normalization. 

The MEM statistical model of a particle physics 
process is significantly more precise than the models 
employed by the distribution fitting methods, as it 
takes into account the process phase space density. 
Detailed shapes of detector transfer functions can be 
utilized, while techniques based on minimization 
essentially assume Gaussian resolutions. These and 
other advantages of MEM [146] result in an improved 
statistical precision of the parameter estimates. On 
the other hand, efficient calculation of the phase space 
integral (4) is nontrivial, as it can not be performed with 
standard phase space sampling schemes developed for 
Monte Carlo event generation because of the presence of 
transfer functions which alter the peak structure of the 
integrand. This situation resulted in a development of 
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a number of ad hoc dimensionality reduction and phase 
space sampling schemes for calculating (4) in the context 
of the ti production and decay. A recent study [147] 
attempts to address this problem in a systematic manner 
for a number of processes. 

All Mt measurements performed so far integrate (4) 
over the leading order parton phase space, so that the 
final state "soft QCD" processes (parton showering and 
hadronization) arc combined together with the detector 
response, and this combination is subsequently modeled 
empirically by the detector transfer function. For each 
channel, the transfer fimction of the process is factorized 
into a product of transfer functions for K individual 
partons traced in the detector (neutrinos are excluded): 

K 

W(y|x,a)= ^ J^PmukWkiymAy^k^a), (6) 

7r[mi,...,mjf] k=l 

where Xfe is the phase space coordinate of the k^^ 
parton, ^'^i'*^- the measured quantities for the "physics 
object" with index ruk (jet, lepton candidate) obtained 
in the event reconstruction and pattern recognition 
process, Pmkk are the prior probabilities for associating 
parton k with physics object mfe, and the channel 
index i is omitted for brevity. The sum is performed 
over K\ possible assignments of indices [m\, ...,mK\ to 
permutations of [1, i^T]. For ti signal modeling, it can 
be assumed that leptons are identified unambiguously 
{i.e.. Prn,,k is reduced to Kroneckcr 5qk in case the lepton 
has the index q in the list of physics objects), but jet 
charge and flavor can not be determined with complete 
certainty. A number of measurements [148, 149, 98, 99] 
utilized the following Pmuk assignment scheme: 

Pnikk = Ptag(y) if k refers to a 6 or 6 quark in the 
final state and ruk refers to a 6-taggcd jet reconstructed 
in the detector. Ptag(y) is the probability to flavor tag 
a 6 jet as a function of its reconstructed momentum and 
other measured quantities (such as the number of tracks). 

Prukk = 1 — Ptag(y) if k refers to a 6 or 6 quark in 
the flnal state and mj, refers to an untagged jet. 

Prrikk = Pmistag(y, /) if k rcfcrs to a light (/ = u, d, 
or s) or charm (/ = c) quark in the final state and rUk 
refers to a 6-tagged jet. Pmistag(y, /) is the probability to 
mistag a jet originating from a quark with flavor f ^ b 
as a 6 jet. 

Pmkk = 1-Pmistag(y, /) if k refers to a light or charm 
quark in the final state and rrifc refers to an untagged jet. 

Although in practice the priors Pm^k are derived as 
functions of ym^, their dependence on y is usually mild 
and can be neglected in comparison with the fast varia- 
tion of Wk (y I x;j , a) . It is also commonly assumed that the 
overlap between phase space regions which contribute to 
different permutations is negligible. With these approx- 
imations, the transfer functions for individual partons 
can be normalized either by /y^ W/c(yfc|xfc, a) rfyfc = 1 
or by /y^ fifc(yfe)Wfc(yfc|xfc,a)dyfe = efc(xfc,a). Both of 
these normalization conditions assume (either implicitly 
or explicitly) that the factorization model can also be ap- 
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Figure 20. Jet transfer functions for ligtit 
quark jets, \r)\ < 0.5, for parton energies Ep = 
30 GeV (solid curve), 60 GeV (daslied curve), 
and 90 GeV (dash-dotted curve). Perfect 
angular resolution is assumed [97]. 

plied to acceptance and efficiency. An example jet energy 
transfer function is shown in Figure 20. 

Appropriate normalization for Pm^k depends, in 
general, on the precise meaning of these coefficients. 
For the flavor tagging scheme outlined above, it 
becomes J^Piag = 1, where ptag = Ilk=iP'nkk is the 
permutation-specific probability of a certain heavy fiavor 
tag assignment and the sum is performed over all 2^ 
possible assignments. Note the absence of the sum 
over permutations, contrary to the convention tised by 
some authors [97]. Different permutations correspond 
to different phase space points, and the overall transfer 
function has to be normalized for each phase space point 
separately. 

On top of the product model (6) for the detector 
transfer function, additional correction terms were 
utilized in a number of Mt measurements: 

— Simplifying assumptions about jet transfer functions 
{e.g., perfect angular resolutions and independence 
of jet response from proximity to other jets) 
were partially compensated for either by adjusting 
the process phase space density (the "effective 
propagators" approach of [148]) or by introducing 
transfer functions which model smearing of the 
angular distances between jets [150]. 

— Presence of the initial state QCD radiation was 
modeled either with a prior on the transverse 
momentum of the ti system [148, 149, 151, 117] or 
with an explicit transfer function which takes into 
account additional energy visible in the detector and 
not associated with the K most energetic physics 
objects expected at the leading order [152]. 

Out of the 32 variables needed to specify the 
phase space point for the — > — > 6 partons 
reaction, only the masses of initial partons and outgoing 
charged leptons and neutrinos can be considered exactly 
known. Taking into account the energy-momentum 
conservation, this leads to 22, 24, and 26-dimensional 
phase space integrals in the dilepton, i+jets, and all- 
hadronic channels, respectively. In case the integral (4) 
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is evaluated by Monte Carlo integration, the relative 
precision of the result is 5rei = (f// {!)) , where 

(/) and (7/ arc, respectively, the average value and 
the standard deviation of the integrand, and N is the 
number of integrand evaluations. I is sharply peaked 
over small regions of phase space which leads to a 
large prefactor ai/{I). For example, for the W mass 
Breit-Wigner distribution integrated between and Mt, 
(7// (/) ss 3.5, so that reaching relative precision of 10~^ 
requires ~ 10^ integrand evaluations. EfRcient Monte 
Carlo calculation of such integrals can be performed 
by applying convergence acceleration techniques, such 
as importance sampling and stratified sampling. These 
techniques, however, only work well if the peak structure 
of the integrand is factorizablc in terms of integration 
variables. In (4), the peaks are not aligned with the 
momenta of final state particles in terms of which the 
phase space integration is formally defined. Instead, the 
transfer functions are concentrated around the observed 
directions of leptons and jets, while propagators in the 
matrix clement are strongly peaked around t and W 
masses. Thus a change to a new, efficient variable set 
is necessary. 

Another technique commonly used to make the pref- 
actor smaller is dimensionality reduction: a constraint is 
imposed on the integration variables, and the integrand 
is evaluated only for the subspace defined by that con- 
straint^. A number of assumptions have been introduced 
by different authors for this purpose, in various combina- 
tions: 

(i) Assume that some or all of the quarks are on shell 
{i.e., masses of the light and b jets are fixed). This 
assumption removes Ug integration variables, where 
Uq is the number of quarks in the final state. 

(ii) Ignore individual transverse momenta of the incom- 
ing partons and consider only the transverse momen- 
tum of the tt system (—2 integration variables). 

(iii) Assume that tt transverse momentum is (this 
assumption is stronger than the previous one, —4 
variables). 

(iv) Assume that jet directions are perfectly measiired in 
the detector. In this case, jet transfer functions are 
proportional to S'^{fl — 0,meas) (—2 rig variables). 

(v) Assume that charged lepton momenta are perfectly 
measured (— variables). 

(vi) Use narrow width approximations for t (—2 
variables) and/or W (also —2). 

In principle, any such assumption reduces the fidelity 
of the statistical model, but practical considerations 
(in particular, the need to evaluate a large number of 
integrals for a set of parameter values using limited CPU 
resources) often take priority. For example, in the original 

1 Prefactor dependence on the integral dimensionality, D, is 
determined by the exact nature of the integrand and the chosen 
subspace. In the mildest case, when the peaJfs are fully factorized, 
prefactor behaves as 0{D). 



MEM Mt measurement by D0 in the £-|-jets channel [96] , 
(i), (iii), (iv), and (v) were assumed which leads to a 5- 
d integral. Masses of W and top resonances, as well as 
the momentum magnitude for one of the partons in the 
hadronic W decay were chosen as integration variables. 
This resulted in a reasonably well factorized phase space 
peak structure, so the integrals could be subsequently 
evaluated automatically by the VEGAS [153] adaptive 
numerical integration program. In a recent £-|~jets 
Mt measurement by CDF [149], only (ii) and (v) 
were assumed which allowed the authors to employ 
detailed transfer function models and resulted in a 19-d 
integral. The convergence was accelerated by appropriate 
variable transformations (importance sampling), by early 
pruning of jet permutations with small contributions 
into the overall integral, and by use of low-discrepancy 
sequences [154] for points in which the integrand was 
evaluated (Quasi-Monte Carlo). 

The Dynamical Likelihood Method (DLM) [155, 156] 
is a phase space integration technique which evaluates (4) 
in a particular manner. Note that the integral can be 
formally written as 

my\^) = ^ugfc) (|M.(x,a)|2r.(x,a))^^ 
x^^T4^,(y|x,a)dx, 

where stands for averaging over points x e $i 

distributed in the phase space with density Gi(x|y,a) = 
vt^,(y|x,a) — Following the naming convention 

M',(y|x,a)dx ^ 

of [156], we refer to Gj(x|y,a) as posterior transfer 
function (PTF). Phase space sampling according to PTF 
efficiently takes into account all structure present in the 
integrand due to detector resolution (peaks in the matrix 
element still require special treatment). In principle, 
PTF can be derived from Monte Carlo simulations of the 
physics process and detector, using joint distributions of 
X and y in the region defined by fi(y) = 1, where every 
event enters with the weight (|Mj(x, a)pTi(x, a)) 
In practice, however, due to large dimensionalities of 
X and y, some kind of a PTF product model must 
be employed. When the terms in the PTF product 
expansion, such as jet posterior transfer functions, are 
derived, correlations between different x dimensions 
(caused, e.g., by energy-momentum conservation and 
complicated phase space boundaries) arc ignored. During 
Monte Carlo integration, lepton and jet variables are 
sampled first, while proper kinematics is subsequently 
enforced by imposing constraints on invisible particles 
(neutrinos). In addition, the factor Wi(y]x, a)dx 
which depends on a and normalizes Pi(y|a) in (7) 
is usually neglected (i.e., assumed to be constant). 
Compounded, these approximations result in some 
degradation of the statistical model and a biased 
maximum likelihood estimator of Mt [157, 158, 159]. The 
estimator bias is subsequently removed by a calibration 
procedure (see Section 5.6). 



CONTENTS 



21 



5.5. Background handling 

In addition to optimizing event sample selection 
requirements and heavy flavor tagging, a number of 
methods have been developed for the purpose of 
strengthening the background suppression and for taking 
into account background contamination in the Mt 
estimates. Increase in the signal fraction improves 
sample sensitivity to M^. At the same time, background 
discrimination should preferably be implemented in such 
a manner that this fraction (a nuisance parameter in 
the measurement) is not correlated strongly with the Mt 
estimate. The relative importance of these arguments in 
the context of any particular Mt measurement method 
is not obvious a priori. As the results (with the notable 
exceptions of [105, 160]) are usually presented without 
describing the extent to which different background 
suppression options were explored, relative merits of 
different background handling techniques are difficult to 
compare. 

Correlation reduction between fg and Mt was 
emphasized in [101, 104]. With this purpose in mind, one 
can not rely upon the most distinguishing properties of tt 
events, the invariant masses of various jet combinations 
and the large amount of energy visible in the detector, for 
background rejection. Topological event characteristics 
have to be exploited instead, such as differences between 
angular distributions of ti decay products and particles 
produced in background processes. Even though there 
appears to be no simple kinematic variable of this kind 
with high discriminating power, optimized combinations 
of multiple variables have been used in practice with 
considerable success. For example, the "low bias 
discriminant" technique of [101, 104] developed for the 
f+jcts channel employed a likelihood ratio discriminant 
in the form of a cut on D t r (y) = — ^ , where 
Ps{y) and Pb{y) are the quasi-probability densities of 
the signal and background, respectively. The set 
of observed variables y (missing transverse energy, 
aplanarity, centrality, and one other variable which 
characterizes average angular separation between jets) 
was chosen and the shapes Ps{y) and Pb(y) were tuned 
in such a manner that the distributions of D lb (y) were 
independent from Mt. The discriminant distributions 
for simulated samples of signal and background events 
are shown in Figure 21. The DLB(y) cut was then 
optimized for the best overall statistical uncertainty while 
encumbering little correlation between and Mt. The 
use of a neural network with the same input variables was 
explored as well, with similar results. 

Neural networks have also found their use in the 
Mt measurements which do rely, at least partially, 
upon energy-dependent variables for background dis- 
crimination. According to this approach, better sig- 
nal/background separation is valued over reduced cor- 
relation between and Mt. These studies include [148, 
149] in the £-hjets channel and [106, 87, 105, 113] in the 
all-hadronic channel. 
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Figure 21. The Dlb discriminant plotted 
for the Mt = 175 GeV/c^ ti sample (hatched) 
and the simulated ba<;kground (not hatched). 
The histograms are normalized to the same 
area [104]. 

An advanced investigation of the effect of back- 
ground suppression on the precision of Mt measurement 
in the dilepton channel has been carried out in [160]. In 
this study, a neural network with six input variables was 

directly optimized for the best statistical uncertainty of 
the Mt estimator. The estimator itself was obtained by 
MEM in which the background probability was calcu- 
lated for the Z/^* -|- 2 jets and WW + 2 jets production 
as well as for jets faking leptons in Vt^ -I- 3 jets. The 
NN optimization algorithm (neuroevolution) allowed for 
modification of both NN topology and weights. As a 
result, the space of potential selection requirements was 
explored in considerable detail. Counter to prior expecta- 
tions, the best discriminator permitted a significant frac- 
tion of background events (about 65%) to remain in the 
sample. The loose selection resulted in «20% improve- 
ment in the a priori statistical uncertainty in comparison 
with the previous dilepton sample definition employed 
in [151]. 

A number of Mt measurements performed with 
MEM did not evaluate background probability explicitly. 
The event observation probability was formed using the 
signal hypothesis only, so background did not contribute 
to (3) and had to be taken into account by other 
means. It is common for such measurements to discard 
events whose peak Mt likelihood magnitude is below 
a certain cutoff. This requirement discriminates the 
signal not only against the background but also against 
ti events in which an incorrect set of jets was selected 
to form the ti system (this can happen, for example, 
due to a presence of an energetic initial state radiation 
jet). Several distinct approaches can be employed in 
order to form a consistent Mt estimate. One can 
fall back onto the template method [117], adjust the 
likelihood so that the background contribution is removed 
on average [148, 149] + , or just correct for background 
presence in the calibration procedure [157]. The template 
method can handle an arbitrary background mix and 
therefore it was used in the background-dominated all- 
hadronic channel; however, event-by-event resolution 

+ A similar approach called "pscudolikcliliood method" was applied 
as a crosscheck in the template- based measurement [104]. 
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information was discarded by using only the peak mass 
instead of the complete likelihood curve. The other 
approaches can function in signal-dominated samples due 
to an important feature of the Breit-Wigner mass shape 
of the top quark (which limits the resolution that can 
be achieved in a single event even by a perfect detector) : 
the slope of its log-likelihood is bounded. Because of this, 
any noise event, no matter how far away from the signal, 
can have only a limited impact on the estimate. In other 
words, if the MEM analysis does not employ the narrow 
width approximation for the top quark, the maximum 
likelihood estimate of Mt is guaranteed to be robust 
and can tolerate certain deficiencies in the background 
modeling. 

5.6. Calibration and statistical uncertainty 

Due to the complexity of the physical processes 
both in the ti production/decay and in the detector 
response, their statistical descriptions invariably involve 
a number of approximations and simplifications. The 
assumptions built into such descriptions, combined 
with the limited amount of available data, result in 
biased point estimators of and/or incorrect interval 
estimator coverage prior to calibration. Fortunately, 
the consequences of such model misspecifications can 
be studied by testing the estimator performance with 
artificial event samples produced by combinations of 
physics Monte Carlo generators and detector simulation 
packages. The problem of the Mt estimator calibration 
is thereby split into two parts which could be analyzed 
independently: derivation of corrections with simulated 
samples and characterization of the differences between 
simulated samples and observed data. While the 
first part of the problem is specific to a particular 
measurement, the second part is studied intensively over 
the whole lifetime of the experiment (about 25 years for 
CDF and D0) which eventually results in a well tuned, 
highly detailed detector response simulation model. 
Therefore, authors of all Mt measurements described in 
this article assume that their detector simulations can 
accurately predict both the average detector response and 
its inherent randomness for the purpose of evaluating the 
bias and the statistical uncertainty of their Mt estimators 
from the simulated samples. 

When the expected statistical uncertainty of the 
measurement is small, it is sufficient to utilize a 
linear approximation to the behavior of a biased point 
estimator, a, as a function of the "true" estimated 
parameter a (which can include, for example, Mt and 
JES): a(a) = a(ao) -|- Jo(a — ao). The Jacobian 
matrix Jq = (||) is evaluated numerically, with 
simulated samples, for the parameter value ao close to 
the expected measurement result. Assuming that Jq is 
not singular, a consistent point estimator is obtained 
by ac = ao + Jo~"'^(a — a(ao)). This transformation 
is often referred to as the "mapping function". As a 
rule, the estimator a is accompanied by an uncalibrated 



estimate of the parameter covariance matrix, (which 
can be obtained, for example, from the Hessian matrix 
of the sample log-likelihood at the maximum). After 
application of the mapping function, this matrix is 
adjusted according to the standard error propagation 
formula: V = Jf^^V^Jo'^'^ ■ Even if the resulting 
V estimate is asymptotically correct, its finite sample 
behavior is typically not well understood. In the 
M ( measurements, the parameter correlation cxxifficicnts 
which can be extracted from V are usually not of 
interest, and the complete calibration of V is not 
performed. Instead, the standard deviations of individual 
parameters are scaled so that the pull* distributions of 
these parameters (with all other parameters eliminated 
by profiling or marginalization) have unit width. These 
pull distributions are obtained with a large number 
of "pseudo-experiments" in which ac is evaluated for 
simulated event samples whose number of entries and 
sample composition is consistent with that expected in 
the data. Resampling techniques are commonly utilized 
to increase the effective number of pseudo-experiments 
without simulating additional events [161]. 

6. Sources of systematic uncertainties 

In addition to the actual event samples used to determine 
the mass of the top quark, the Mt measurement results 
depend on a number of inputs, both theoretical and 
experimental. A number of limitations on the expected 
precision is thus imposed, either due to the finite size of 
data samples which could be used for detector calibration 
or due to the incomplete description of perturbative and 
non-perturbative parts of the relevant QCD processes. 
To characterize such a lack of knowledge, possible 
sources of systematic uncertainties are divided into a 
number of categories which, for all practical purposes, 
can be considered independent. Systematic uncertainty 
contributions from each category are subsequently added 
in quadrature. 

Several different methods are used to evaluate the 
effect of the independent uncertainty sources on Mt 
estimates. The first step usually consists in identifying a 
nuisance parameter associated with a certain particular 
source in the relevant theoretical or detector model. It is 
commonly assumed that this parameter has a Caussian 
prior with a range of variations, a, determined from a 
number of consistency checks between the observations 
and the model. One of the following approaches is 
employed afterwards: 

• The event sample likelihood is evaluated as a 
function of this parameter. The parameter is then 
marginalized or the likelihood is profiled. This 
procedure results in a statistically efficient treatment 
of the uncertainty source under study: the events 
are combined in an optimal manner which takes 

* "Pull" is the difference between the estimated value of a 
parameter and its true value, divided by the estimated standard 
deviation. 
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into account the system response to the nuisance 
parameter variations for the particular kinematic 
configuration encountered. This method is CPU- 
intensive, and so far it has been applied only to 
the most important nuisance parameter in the Mt 
measurement: the detector jet energy scale. 

• Large simulated event samples are produced with 
— 1, 0, and +1(7 changes in the parameter, and 
with the input top mass close to the obtained 
result. Corresponding estimates are obtained: 
MjT, M°, and M^. These estimates are then 
sorted in increasing order. If, after sorting, Af^ lies 
between and then the estimator systematic 
uncertainty associated with this source is taken to 
be |M+ - M~\/2. If, on the other hand, 
ends up outside the interval defined by the 
and endpoints then the uncertainty is evaluated 
as max(|M+ - M0|/2, \M^ - M0|/2). The main 
advantage of this method is its simplicity. It works 
well on average if the Mt estimator can be reasonably 
expected to change linearly as a function of the 
nuisance parameter and if the \M^ — M^\ shift is 
much larger than the statistical uncertainty of the 

or determination. 

• Instead of generating three separate simulated 
samples, an event reweighting scheme is applied to 
the 0(7 sample in such a manner that the effective 
value of the parameter under study is shifted in 
either direction while other parameters are kept 
intact. The changes in the Mt estimator response 
are processed as in the previous method. While 
designing such a reweighting scheme can be a 
complicated problem in itself, this technique can be 
used to study very small systematic shifts because 
the statistical uncertainty of the and 
determination mostly cancels out in the — 
difference. 

It is not always possible to identify a limited set of 
nuisance parameters or to define reasonable ranges of 
their variations. For example, when there are only two 
reasonable hypotheses available for comparison, the full 
extent of the change in the estimator between these 
hypotheses is used as the systematic uncertainty. 

6.1. Jet energy scale 

As the ti signature contains at least two jets in the final 
state and most measurement techniques make explicit use 
of the jet transverse momenta, imperfect calibration of 
the detector jet energy scale has traditionally resulted 
in the single largest systematic uncertainty of the top 
quark mass estimates. In the analysis of Tevatron Run I 
data it was found that 1% change in the jet energy scale 
corresponds, as a rule of thumb, to a 1-1.5 GeV/c^ shift 
in the Mt estimate. Given the 2-3% jet energy calibration 
uncertainties the D0 and CDF collaborations quote, this 
translates into about 3 GeV/c^ limitation on the Mt 
measurement precision [162]. 



Recent Mt measurements have by far surpassed 
this precision by calibrating the jet energy scale in 
situ with the mass of the W boson. Both top quark 
and W boson masses can be simultaneously estimated 
in ti events in which at least one of the Ws decays 
hadronically. If the correlations between these estimates 
are known, a constraint imposed on the W mass estimate 
leads to an improvement in the top mass determination. 
Technically, this idea is more conveniently realized by 
introducing an overall factor, JES, which is used to 
scale four-momenta of all jets in the event sample with 
respect to their reference values obtained with a standard 
calibration [119, 114]. The tt samples accumulated by 
CDF and D0 are sufficient to estimate JES defined in 
this manner with a significantly higher precision than 
that provided by the standard jet calibration. 

Although the automatic adjustment of the overall 
JES factor improves the most important jet-related 
systematic uncertainty, notable differences remain in 
the parton fragmentation and hadronization dynamics 
between light quarks produced in the W boson decays 
and b quarks. Four additional sources of systematic 
uncertainties have been considered to account for these 
differences: 1) the uncertainties on the branching fraction 
of semi-leptonic decays of b and c quarks as measured 
by the LEP experiments; 2) the uncertainties on b 
quark fragmentation parameters [163] computed as the 
difference between the parameters measured by the SLD 
collaboration [164] and the ALEPH, DELPHI, and OPAL 
collaborations [165, 166, 167]; 3) the uncertainty in 
the calorimeter response difference between light and 
heavy flavor quarks; and 4) the calorimeter response 
non-linearity which contributes an uncertainty due to 
different transverse energy spectra of b quarks and W 
decay products. 

In a number of recent Mt measurements [114, 93, 
149, 65], JES is related to the fractional systematic 
uncertainty from prior calibration, a jet, by 

JES=l + /^jES-(Tjet, (8) 

where Aj^s is the underlying nuisance parameter which 
represents the relative jet energy shift in units of pt- 
dependent a jet (see Figures 8 and 9). This formula 
explicitly manifests the assumption that energies of 
jets with different transverse momentum or flavor are 
completely correlated. Subsequent treatment of the 
A,/B5 parameter (e.g., elimination by profile likelihood) 
minimizes the dependence of the Mt estimate on this 
fully correlated component of the jet energy scale 
uncertainty. It is worth mentioning that this treatment 
can be extended so that it takes into account all 
important uncertainties related to jet reconstruction 
in a uniform and consistent manner. The prior 
knowledge of the jet energy systematic uncertainty can 
be quantified not only with a pr-dependent o-jet but 
with a complete covariance function which depends 
on jet pt and fiavor. According to the Karhunen- 
Loeve theorem [168], such a covariance function 
permits a decomposition of the jet energy systematic 
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uncertainty into orthogonal independent components. 
Components corresponding to several largest eigenvalues 
of the covariance function could then be treated as 
measurement nuisance parameters, by in situ calibration 
or by other techniques. Another important advantage of 
this decomposition stems from the simplicity with which 
various jet-related measurements performed with the 
same detector can be cross-calibrated {i.e., component 
constraints obtained in one measurement can be 
immediately applied in another) . We therefore encourage 
derivation and use of such covariance functions (or 
covariance matrices in case the jet pT range is binned) 
for representing jet energy scale uncertainties in future 
measurements of Mt and other jet-related quantities. 

After calibration of an overall JES factor as 
described above, the next step is to evaluate the 
systematic uncertainty due to the uncertainties on 
the individual jet corrections. These arc shown in 
Figures 8 and 9 for CDF and D0, respectively. The 
Pt dependence of the systematic uncertainties on the 
individual corrections arc very different. A "Residual 
JES" systematic uncertainty (see Tables 3 and 4 in 
Section 7.1) is evaluated by adding in quadrature the 
uncertainties for each component. These are obtained 
in the usual manner, i.e., by generating samples with the 
correction shifted by ±lcr to obtain the corresponding 
mass shifts. The procedure described earlier is then 
followed to obtain a final uncertainty. 

6.2. Lepton-related uncertainties 

The response of the detectors to leptons is calibrated 
using J/^* decays and Z — >• events. The uncertainty 
on the overall lepton pT scale in ti events is conservatively 
estimated to be 1%. As in the case of jet energy response 
determination, there are additional effects to consider. 
The muon momentum resolution is found to be better in 
MC than in data. Appropriate extra smciaring is applied 
to the muon pT in the simulated samples in order to 
estimate the corresponding Mt systematic uncertainty. 
Further, for i'+jets and dilcpton channels, the complete 
data acquisition chain is triggered by the presence of a 
charged lepton in the detector. The trigger turn-on curve 
is computed with its uncertainty, and the uncertainty 
is propagated to the Mt estimate. These additional 
Mt variations are small, and they are neglected if their 
magnitude is below 100 MeV/c^. 

6.3. Uncertainties from Monte Carlo generators 

Signal Modeling. CDF evaluates the signal 
modeling systematics by comparing the Mt measurement 
caUbrations obtained with PYTHIA and HERWIG. D0 
uses the difference in reconstructed values obtained 
with their default generator, ALPGEN interfaced to 
PYTHIA, and with PYTHIA proper. Recently, D0 has 
evaluated the uncertainty due to ignoring higher order 
Feynman diagrams for ti production by comparing tt 



samples generated by MC@NLO and by ALPGEN + 
HERWIG [99] (see Table 4 in Section 7.1). 

Color Reconnection (CR). As the top quark and 
W bosons decay quickly on the timescale associated 
with the parton shower and fragmentation processes 
{i.e., l/rt,l/T\Y <C 1/Aqcd), it is possible that 
the decay products from the different top quarks 
interact with each other via color reconnections. Color 
reconnection effects were first investigated at LEP [169] 
as a possible source of systematic uncertainty in precision 
measurements of the W mass in WW events; no evidence 
of color reconnections was foimd in these studies. 
The partons emerging from the Tevatron pp initial 
state carry color charge, thus phenomenological color 
reconnection modeling is significantly more complicated 
than for the e+e~ initial state at LEP. Recently, a new 
version of PYTHIA (v6.4) with improved description of 
parton shower (pT^-ordered) , multiple parton interactions, 
underlying event, and color reconnection (CR) effects has 
been released [170] and subsequently tuned to collider 
data [81]. The CR systematic uncertainty is calculated 
as the difference between the Mt estimators obtained 
with the event samples generated by PYTHIA 6.4 tune 
"A-pro" and PYTHIA 6.4 tune "ACR-pro". These 
tunes differ in the color reconnection model but use 
the same mass-ordered parton shower model as the one 
used in the default PYTHIA 6.2. This approach was 
cross-checked with the pT-ordered parton shower, and 
compatible top mass difference was found. The difference 
between the mass-ordered and the py-ordercd parton 
shower prescriptions is not taken into account in the 
signal systematics, as preliminary versions of the latter 
did not match CDF jet shape evolution [171]. More 
studies of the pT-ordered parton shower model are in 
progress. 

Multi Hadron Interactions (Pile- up). The 

luminosity profile, i.e., the number of interactions 
per bunch crossing, is different in the data and 
in the simulated samples used to calibrate the Mt 
measurements. This disagreement leads to imperfect pile- 
up modeling. To first order, appropriate dependence 
of the jet energy corrections on the number of primary 
vertices (NPV) found in the event cancels this effect out, 
so the corresponding systematic uncertainty is expected 
to be small. Nevertheless, a conservative estimate of 
the uncertainty due to multiple interactions is made by 
studying the dependence of the Mt measurements on 
NVP in the simulated samples as well as by comparing 
the jet response as a function of NPV in the simulated ti 
samples and in minimum bias events. 

Initial and Final State Radiation (ISR/FSR). 
Default Monte Carlo generators used by the CDF and 
D0 collaborations to simulate ti production and decays 
utilize LO matrix element of the process. NLO effects 
appear in the form of additional initial (ISR) and 
final state radiation (FSR) as well as loop corrections. 
To evaluate the systematic uncertainty associated with 
the imprecise radiation modeling, additional tt samples 
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are generated by PYTHIA with increased/decreased 
amount of ISR and FSR. The uncertainty on the 
parameters which control the strength of the ISR 
was estimated by CDF using pp — >• Z'*/7*+jets — >■ 
/x+/z~+jets colHsions [114]. Just as ti events, these 
are produced predominantly via qq annihilation. The 
transverse momentum of the muon-antimuon system, 
^ , depends on the ISR presence in the event. The 

Pj, ^ distribution was used to constrain the generator 
parameters affecting the ISR modeling. As both the 
initial and the final state radiation (FSR) processes 
are subject to DGLAP evolution, the same parameters 
determine the amount of FSR in the generated samples. 
Thus, the ±la changes to these parameters, which are 
used to derive corresponding Mf and M^, affect both 
ISR and FSR simultaneously. 

Parton Distribution Functions (PDFs). The 
PDFs used to model ti production are determined by fit- 
ting multiple observations performed at a number of ep 
and pp collider experiments. Statistical uncertainties of 
these fits, together with the approximations used to de- 
rive the PDF functional representations, contribute to the 
Mt measurement systematic uncertainty. CTEQ5L [82] 
is used as the default PDF set for the CDF Monte Carlo 
generators. This set is compared to the MRST72 [84] set 
in order to estimate the uncertainty arising from the pos- 
sibility of multiple functional representations. The PDF 
sets MRST72 and MRST75 [84] are compared to each 
other in order to estimate the Mt uncertainty due to the 
imprecisely known as value. The CTEQ6M PDF sets 
are produced together with the eigenvectors of the pa- 
rameter covariance matrix {i.e., the 20 principal compo- 
nents) [83]. Both CDF and D0 evaluate the Mt uncer- 
tainty due to the imprecisely known PDF parameters by 
using the CTEQ6M sets in which parameters are shifted 
along these eigenvectors and by combining in quadrature 
the resulting shifts of the Mt estimate. 

Background Modeling. For simulation of the 
VF-|-jets samples, there is an ambiguity associated with 
the choice of factorization and renormalization scales, 
represented in the literature as /i^?, ^n, and/or Q^, 
at which to evaluate the relevant matrix elements. 
A set of additional VK+jcts samples is gcncratod by 
doubling and halving the Q^. The background fraction 
and kinematic distributions are re-evaluated using these 
samples. Corresponding M^ estimator shifts are included 
in the systematic uncertainty of the measurement. 

Background Normalization. A number of 
SM processes involving high mass particles appear 
as backgrounds to tt production, as described in 
Section 3.5. As a rule, backgrounds are generated at LO. 
It was found that background kinematic distributions 
relevant for Mt measurements do not change appreciably 
between LO and NLO simulations. The LO shapes 
of such distributions {e.g., mt in the template-based 
measurements) are therefore normalized to NLO (or 
NNLO if available) theoretical cross sections. The Mt 
systematic uncertainty due to background normalization 



is obtained by studying the response of the Mt estimator 
as these normalizations are varied within their theoretical 
uncertainties. For VK+jcts production, an additional 
uncertainty is contributed by the imprecisely known 
fraction of b jets in the sample. This fraction 
is measured in the W + 1 jet data and then 
extrapolated to the tt event selection together with 
the uncertainties obtained [24, 64]. The QCD multijet 
background kinematics and normalization is evaluated 
using an independent dataset, and the uncertainty on its 
normalization is propagated to the Mt estimate [87]. 

6.4. Other uncertainties in detector modeling 

The D0 collaboration finds a small difference in the 
jet reconstruction efficiency for data and Monte Carlo 
samples. The corresponding uncertainty on Mt was 
found to be very small [99]. 

The consistency in the jet energy resolution between 
data and simulated events has been studied by D0 for 
the 7-1-jet sample. It was found that the resolution is a 
few percent better in the simulation. D0 smears the jet 
energies to match the resolution observed in the 7-l-jet 
events, and then estimates a systematic shift on Mt by 
varying the smearing function within its uncertainty [99] . 
For CDF, this effect is of the order of a few tens of 
MeV/c^ and is thus neglected. 

Top-antitop events are collected by triggering 
data acquisition and recording using a certain set of 
requirements. The trigger efficiency uncertainties are 
small and their effects on Mt measurements are usually 
neglected, as the physics objects requirements in the data 
analysis procedures are chosen to be sufficiently above the 
trigger thresholds. 

The 6-tagging algorithm cfficicincy has been studied 
in QCD bb events. The systematic uncertainty has been 
assigned for the difference between the efficiency found 
in the data and in simulated events. This uncertainty 
has been subsequently propagated to Mt measurements. 
This effect is usually negligible in comparison with other 
systematic uncertainties [99]. 

6.5. Uncertainties from the measurement method 

This category includes all uncertainties stemming from 
the finite size of simulated event samples used to 
calibrate the measurement, such as the uncertainty in 
the parameters of the mapping function. For template- 
based measurements in which templates are represented 
by continuous probability densities, this also includes 
the template parameterization uncertainties estimated 
from a or maximum likelihood fit of the template 
distributions. 

7. Tevatron Run II measurements 

From the first Tevatron collisions in October 1985 and 
until the start-up of LHC 2.36 TeV centcr-of-mass 
collision operations in December 2009, the Tevatron 
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collider at Fermilab was the only place on Earth where 
top quarks were copiously produced. It is thus not 
surprising that until 2011 all direct measurements of 
the top quark mass utilized the data collected by the 
two main Tevatron experiments, CDF and D0. About 
160 pb"^ of integrated luminosity was delivered to these 
experiments during the 1.8 TeV collision operation 
period known as "Run I" (1992-1996). By the time of 
this writing (June 2011), about 11 fb"-'^ of data were 
accumulated at ^Js = 1.96 TeV during the "Run II" 
period which commenced in March 2001 and is still in 
progress. In this section we discuss in greater detail the 
most advanced Tevatron measurements of Mj performed 
so far with the Run II data. 

7.1. Lepton+jets topology 

Measurements performed in the ti £+jets channel, 
£ = e OT n, have traditionally resulted in the most 
precise estimates of the top quark mass at the Tevatron. 
This particular channel is favored by its relatively large 
branching fraction (w 34%) and presence of a lepton and 
missing energy in the final state which allows for efficient 
background suppression. In addition, W boson decays 
into two jets provide an important reference point for 
detector jet energy calibration. 

At the time of this writing, the most precise 
single measurement of is performed with 5.6 fb"^ 
of data collected by the CDF detector during Run 
II of the Fermilab Tevatron [149]. Most of the 
events used in this measurement are collected with high 
transverse momentum lepton triggers that require a well- 
reconstructed electron or muon candidate with px > 
18 GeV/c in the central detector region [172]. The lepton 
Pt cutoff in the sample selection criteria is increased 
to 20 GeV/c in order to avoid the difficult to model 
trigger turn-on curve. The sample also contains a fraction 
of events with loosely idc^ntificxl muons collected with a 
missing transverse energy trigger. 

Exactly four jets with transverse energy Ex > 
20 GeV are required within the pseudorapidity region 
\r]\ < 2.0, with at least one jet tagged as a 6 jet 
using a secondary vertex tagging algorithm. The 
presence of the neutrino in the final state is exploited 
by imposing the requirement fix > 20 GeV. The overall 
tt selection efficiency with these criteria (including the 
£+jcts branching fraction and trigger acceptance) is 
about 2%, while the resulting S/B ratio in the selected 
sample is 3.6. The expected sample composition is shown 
in Table 2. 

The subsequent analysis of the event sample is 
performed by the matrix element method. A highly 
detailed statistical model of the pp tt ^ £+jets signal 
process and of the detector response is developed and 
utilized. The process density in the parton phase space, 
$0j is described by a leading order matrix element which 
includes both qq — > tt and gg — >■ tt production processes, 
as well as tt spin correlations [173]. Lepton momenta are 



Table 2. Expected sample composition for the 
CDF £+}cts measurement with J Cdt = 

5.6 fb"-'^. The tt contribution is estimated 
using a cross section of 7.4 pb and Mt = 
172.5 GeV/c2 [149]. 



Event Type 


1 b Tag 


> 2 6 Tags 


W + Heavy Flavor 


129.5 ±42.1 


15.7 ±5.5 


Non-H" QCD 


50.1 ±25.5 


5.5 ±3.8 


W + Light Flavor Mistag 


48.5 ± 17.1 


1.0 ±0.4 


Diboson {WW, WZ, ZZ) 


10.5 ± 1.1 


1.0 ±0.1 


Single Top 


13.3 ±0.9 


4.0 ±0.4 


Z ^ ££ + jets 


9.9 ± 1.2 


0.8 ±0.1 


Total Background 


261.8 ±60.6 


28.0 ±9.6 


tt Signal 


767.3 ±97.2 


276.5 ±43.0 


Total Expected 


1029 ± 115 


304.5 ±44.1 


Events Observed 


1016 


247 



assumed to be well-measured. The detector response to 
jets is modeled by nonparametric statistical techniques 
as a function of parton transverse momentum and jet 
mass. A dedicated kinematical mapping is used to relate 
the leading order parton-level phase space to a more 
realistic phase space, $i, in which jets can be massive. 
Both angular and transverse momentum jet resolutions 
are represented, and corresponding degrees of freedom 
are integrated over $i together with the matrix element 
evaluated in $o with the aid of the $i $o mapping. 
To improve the convergence rate, the 19-dimensional 
numerical integration is performed by Quasi-Monte Carlo 
(the dimensionality reduction assumptions leading to this 
integral were described in Section 5.4). 

The detector transverse momentum jet response is 
calibrated in situ with W decays by employing a special 
parameterization of the transfer functions: 

Wf{u\y:,AjEs) = gf{u-JES\x) (^JES + u^^^ , (9) 

where u = Px,jet/PT,parton, JES depends on Ajes 
according to (8), and = u, a. = A.jes in relation 
to (6). The reference transfer functions, (3j(u|x), are 
derived separately for light and heavy flavor jets in 
several different pseudorapidity regions from well-tuned 
Monte Carlo detector simulations assuming Ajes = 0. 
The subscript / enumerates jet flavors and rj regions 
used. The transfer functions are normalized by the 
corresponding phase space jet reconstruction efficiencies: 

/Cin(x,A^«s)^/("l^'^^s5)^" = ^fi^^^JEs), where 
the lower integration limit Wmin(x, Aj^g) depends on 
Px,parton ^^d On the effective px.jet cutoff for the 
given Ajes- The efficiencies e/(x, Aj^g) are derived 
from Monte Carlo simulations by nonparametric logistic 
regression. 

The background contamination remaining after 
event selection is estimated on event-by-event basis 
with a neural network discriminant. The network 
classification performance is illustrated in Figure 22. 
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Neural network discriminant 




Neural net output 

Figure 22. Neural network output, q, 
for simulated signal and background samples. 
The network is trained with 10 kinematic 
variables [90]. 

Compared to calculating the background contribution 
into the observed event probability using matrix element 
techniques, this method results in a significantly simpler 
background treatment at the cost of some degradation 
of the Mt estimator statistical uncertainty. The overall 
sample likelihood is adjusted to remove the expected 
background contribution according to 

lnLadj(A/t,Aj£s) =Ej [lnL(yj|Mt,AjBs) 

- /b(gj)lnLb(Mt,Aj£;s) . 

Here, Ladj is the adjusted total likelihood for a given 
set of events, L{yj\Mt,l^jEs) is the signal likelihood 
calculated for each event j, and fhilj) is the background 
fraction for a given event estimated from the neural 
network output: fhiqj) = B{qj)/{S{qj) + B{qj)). 
In this ratio, the B(q) and S{q) distributions are 
normalized to the overall expected background and signal 
fractions, respectively. A simulated signal sample with 
Mt = 170 GeV/c^ is used to define S{q), as shown in 
Figure 22. InL^iMt, Ajes) is the average background 
log-likelihood estimated from a large sample of simulated 
Monte Carlo events. A calibration (mapping function) 
is applied to Ladj in order to remove the remaining bias 
and to ensure proper frequentist coverage of the interval 
estimate. The calibrated adjusted likelihood is shown in 
Figure 23. The Ajes nuisance parameter is eliminated 
by profiling. 

The systematic uncertainty of the CDF measure- 
ment takes into account a number of sources, according 
to the discussion presented in Section 6. The break- 
down of the assigned systematic uncertainty in a set 
of independent components combined in quadrature is 
reproduced in Table 3. The overall result is Mt — 
173.0 ± 0.7 (stat.) ± 0.6 (JES) ± 0.9 (syst.) GeV/c^, with 
a total uncertainty of 1.2 GeV/c^. 

The current most precise D0 measurement of the 
top quark mass employs the matrix element method to 



CDF Run 115.6 fb 




-0.6^ 



170 171 172 173 174 175 , 

m, (GeV/c ) 

Figure 23. Measured 2D likelihood on 
the CDF data events, with the contours 
corresponding to a 1-, 2-, and 3-cr uncertainty 
in the final Mt measurement from the profile 
method. The marker shows the point of 
maximum likelihood. For consistency with 
other results, the sign of Ajgg plotted is 
reversed w.r.t. (8), (9) [149]. 



Table 3. Systematic uncertainties of the 
CDF £-|-jets Mt measurement with 5.6 fb"-'^ of 



data [149]. 


Systematic Source 


Uncertainty (GeV/c^) 


Calibration 


0.10 


JMonte Carlo Generator 


0.37 


ISR and FSR 


0.15 


Residual JES 


0.49 


6- JES 


0.26 


Lepton pt 


0.14 


IVIultiple Hadron Interactions 


0.10 


PDFs 


0.14 


Background IVIodeling 


0.33 


Color Reconnection 


0.37 


Total 


0.88 



analyze 3.6 fb"^ of Run II data in the £+jets channel [99]. 
The presence of one well-reconstructed charged lepton is 
required by the event sample selection criteria, together 
with exactly four high energy jets and substantial missing 
transverse momentum. A neural network is utilized for 
heavy flavor tagging [60]. At least one 5-tagged jet is 
required resulting in about 70% signal content in the final 
sample. 

A parametric model is employed for the jet energy 
transfer functions, while jet angles are assumed to be 
well-measured. The parameterizations are derived for 
a number of pseudorapidity regions separately for light 
jets, b jets, and b jets with a soft muon tag (in the latter 
case, a fraction of the energy of the jets is carried away 
by a neutrino which escapes detection). The jet energy 
scale is included in the transfer functions as an overall 
multiplicative factor, JES, which is not related to the 
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prior systematic uncertainty: 



Wf{E 



jet I E partoii ; 



JES) 



1 



Bf 



E 



JCt 



JES 



E 



parton 



JES 

In this analysis, the reference transfer functions 
£1/ (i? jet I parton) are represented by the double Gaussian 
formula: 

1 



BsiE 



jet I -^parton) 



exp 



2-!r(p2+P3P5) 

(-^jet~-^parton-Pi) 

(-^ jet parton ~P4)^ 
2^1 



+ P3exp - 



These transfer functions are normalized by 



£'/(-^jet I parton) dE ^ct 



1 



for all possible values of i? parton and of parameters pk-, 
fc = 1, 5. The parameters pk are linear functions of the 
parton energy: p^ = + -E parton ■ ^fc- The coefficients 
flfe and bk are determined by fitting the jet response 
in fully simulated Monte Carlo events. It remains 
unclear how faithfully this model represents the energy 
response of low pT jets: as emphasized in [146], transfer 
functions normalized in this manner are unavoidably 
affected by the jet reconstruction inelRciencies which 
alter the reference jet energy distributions. Transfer 
functions are also constructed for the electron energy 
and muon p^^ . The latter quantity is proportional to 
the muon track curvature in the detector magnetic field 
(curvature resolution determines the p^ uncertainty of 
the reconstructed muon). 

A number of assumptions are introduced to reduce 
the dimensionality of the signal phase space integral to 
ten. The following variables are chosen for integration; 
the transverse momenta of the colliding partons (for 
which priors are derived from PYTHIA), the energy 
associated with one of the quarks from the hadronic W 
boson decay, the masses of the two W bosons and the 
two top quarks, and either the energy of the electron or 
1 Ipt of the muon. The signal matrix element models the 
dominant qq — > tt production process [174], while spin 
correlations are ignored. 

The background contribution into the event prob- 
ability is calculated using the W + A jets matrix ele- 
ment provided by VECBOS [175]. It is assumed that 
this contribution alone is sufficient to model the shape 
of the background likelihood, while other backgrounds 
(mainly QCD multijet events) are not treated explicitly. 
The background normalization is adjusted so that the 
correct tt signal fraction can be reproduced in the anal- 
ysis of simulated event samples. The overall event sam- 
ple likelihood is profiled over the signal fraction nuisance 
parameter. JES is marginalized with a Gaussian prior 
consistent with the JES uncertainty from jet energy cal- 
ibrations. 

Systematics uncertainties of the D0 Mt measure- 
ment with Run lib data are shown in Table 4. The 
source breakdown is substantially different from that 



Table 4. Systematic uncertainties of the D0 
£+jcts Mj measurement with 2.6 fb"^ of Run 
lib data [99] . 



Systematic Source 



Uncertainty (GeV/c^) 



(10) 



Higher Order Effects 


0.25 


ISR and FSR 


0.26 


Hadronization and UE 


0.58 


Color Reconnection 


0.28 


Multiple pp Interactions 


0.07 


Background Modeling 


0.16 


W^-l-jets Heavy Flavor 


0.07 


Scale Factor 




&-jet Modeling 


0.09 


PDF 


0.24 


Residual JES 


0.21 


Data-MG Jet Response 


0.28 


Difference 




^-tagging Efficiency 


0.08 


Trigger Efficiency 


0.01 


Lepton Momentum Scale 


0.17 


Jet Energy Resolution 


0.32 


Jet Identification Efficiency 


0.26 


QCD Background 


0.14 


Signal Fraction 


0.10 


MC Calibration 


0.20 


Total 


1.02 



0) 



1.06 




170 172 174 176 178 180 182 

m, GeV 

Figure 24. Calibrated D0 result obtained 
with the Run lib data in the £-|-jets channel. 
The JES prior factor is included into the 
likelihood. The boxes, representing the bins 
in the two-dimensional histograms of the 
likelihoods, have areas proportional to the bin 
contents [99]. 



utilized by CDF. The generator systematic uncertainty, 
here called Higher Order Effects, was calculated compar- 
ing ALPGENv2-hHERWIG with MG@NLO. The over- 
all result (Run Ha and Run lib combined) is Mt — 
174.94 ± 0.83 (stat.) ± 0.78 (JES) ± 0.96 (syst.) GeV/c^, 
with a total uncertainty of 1.49 GeV/c^. The sample like- 
lihood as a function of Mt and JES (labelled by mt and 
kjESj respectively, on the axes) is shown in Figure 24. 
A set of representative Tevatron Mt measurements 
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Table 5. Representative Tevatron Mt measurements in tlie i?+jcts channel. If a reference is marked by (*) then the 
JES calibration uncertainty is included as a part of statistical rather than systematic uncertainty. 





A/Tot Vi^~»rl 


r rdf 


m t 


C Stat 




ixei. 






yi-u j 


CGeV/c^) 


fGeV/c^l 


CGeV/c^) 




CDF (Run I) 


template 


0.02 


174 


10 


+13 
-12 


[7] 


D0 (Run I) 


template 


0.05 


199 


+ 19 

-21 


22 


[6] 


CDF (Run I) 


template 


0.07 


176 


8 


10 


[5] 


CDF (Run I) 


template 


0.11 


176.1 


5.1 


5.3 


[100, 103] 


D0 (Run I) 


template 


0.13 


173.3 


5.6 


5.5 


[101, 104] 


D0 (Run I) 


MEM 


0.13 


180.1 


3.6 


3.9 


[96] 


CDF 


template 


0.16 


179.6 


+6.4 
-6.3 


6.8 


[119] 


CDF 


DLM 


0.16 


177.8 


+4.5 

-5.0 


6.2 


[176] 


D0 


template 


0.16 


170.0 


6.5 


+10.5 
-6.1 


[136] 


D0 


ideogram 


0.16 


177.5 


5.8 


7.1 


[136] 


D0 


template 


0.23 


170.6 


4.2 


6.0 


[177] 


CDF 


DLM 


0.32 


173.2 


+2.6 
-2.4 


3.2 


[116, 157] 


CDF 


template 


0.32 


173.5 


+3.7 
-3.6 


1.3 


[116, 114]* 


D0 


template 


0.32 


169.5 


4.4 


+1.7 
-1.6 


[178]* 


D0 


MEM 


0.37 


170.3 


+4.1 
-4.5 


+1.2 

-1.8 


[97]* 


D0 


ideogram 


0.43 


173.7 


4.4 


+2.1 
-2.0 


[59]* 


CDF 


MEM 


1.0 


170.8 


2.2 


1.4 


[179]* 


CDF 


template 


1.0 


168.9 


2.2 


4.2 


[140] 


D0 


MEM 


1.0 


171.5 


1.8 


1.1 


[98]* 


CDF 


DLM 


1.7 


171.6 


2.0 


1.3 


[158]* 


CDF 


MEM 


1.9 


172.7 


1.2 


1.8 


[148] 


CDF 


template 


1.9 


171.8 


1.9 


1.1 


[93]* 


D0 


MEM 


2.2 


172.2 


1.0 


1.4 


[180] 


CDF 


template 


3.2 


172.2 


1.1 


1.5 


[94] 


CDF 


MEM 


3.2 


172.4 


1.4 


1.3 


[150]* 


D0 


MEM 


3.6 


174.9 


0.8 


1.2 


[99] 


CDF 


MEM 


5.6 


173.0 


0.7 


1.1 


[149] 
[65]* 


CDF 


template 


5.6 


172.2 


1.2 


0.9 



in the £+jets channel is collected in Table 5 which 
illustrates the evolution of analysis techniques and 

attained precision with integrated luminosity. [hbtp] 
Recent CDF and D0 results are in good agreement with 
each other. 

7.2. All-hadronic topology 

The all-hadronic branching fraction of about 46% is the 
largest among all ti decay channels. Due to the absence 
of neutrinos, this final state is kincmatically constrained, 
and in principle, the ti event can be reconstructed. 
The challenges are reduction of the very large QCD 
background due to multijet production and reduction 
of the 90 possible permutations that can be made in 
assigning 2 jets to b quarks and pairing the remaining 
4 jets. 

Dedicated trigger requirements are needed in order 
to enhance the top quark signal contribution over the 
overwhelming background. Even after surviving such 
trigger requirements, the background is about three 
orders of magnitude larger than the signal. In order to 



reconstruct the tt events, at least six high-p^ jets must 
be present in the detector, while charged and neutral 
Icptons are vetoed. Identifying jets originated from h 
quarks helps both in suppressing the QCD background 
and in reducing the number of possible permutations in 
reconstructing the final state kinematics according to the 
tt — )■ W^bW~b — >■ qiq^bq^qj) hypothesis. The distinct 
kinematic properties of ti events are further exploited to 
bring the S/B ratio to an acceptable level. 

Due to the unique challenges, the observation of this 
decay mode was achieved only after ti events were seen 
in the i'+jets and dilepton channels, by analyzing the full 
Tevatron Run I dataset of 110 pb~^. CDF utilized a 
combination of 6-tagging and kinematic/topological cuts 
to select a relatively clean sample of top quark pair 
events subsequently used to estimate Mt [181]. D0 used 
neural networks to exploit the S/B discriminating power 
of several kinematic and topological observables [106]. 
[hbtp] The ti events in the all-hadronic final state were 
observed again during the Run II by both CDF [182, 105, 
87] and D0 [88] collaborations. The new measurements 
of Mt and tt production cross section benefited from the 
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Table 6. Tevatron measurements in 


the all-hadronic channel. 




Experiment 


Method 


/ Cdt 


Mt 


C Stat 


C syst 


Kei. 






(fbi) 


(GeV/c^) 


(GeV/c^) 


(GeV/cP) 




CDF (Run I) 


template 


0.11 


186 


10 


12 


[181] 


D0 (Run I) 


template 


0.11 


178.5 


13.7 


7.7 


[106] 


CDF 


ideogram 


0.31 


177.1 


4.9 


4.7 


[134] 


CDF 


template 


1.0 


174.0 


2.2 


1.8 


[105] 


CDF 


MEM-assisted 


1.0 


171.1 


2.8 


3.2 


[117] 


CDF 


template 


2.9 


174.8 


1.7 


2.0 


[87] 


CDF 


template 


5.8 


172.5 


1.4 


1.4 


[113] 



larger center-of-mass energy rcstilting in ^ 30% higher 
cross section, from improved detectors, and from better 
analysis techniques. For both data taking periods, the 
QCD multijet background is estimated from data; the 
much larger event sample collected in Run II allows for a 
very accurate control of the background modeling. 

The Tevatron publications of Mt measurements in 
the all-hadronic channel are listed in Table 6. So far, 
only the CDF collaboration measured the top quark mass 
with the Run II data. 

The CDF multijet trigger requires four energetic jets 
in the event and large energy deposit in the calorimeter 
in order to select all-hadronic tt decays while suppressing 
the much more common QCD multijet prodtiction. The 
use of a neural network to obtain a cleaner sample of top 
quark hadronic decays resulted in a large improvement 
in the statistical sensitivity of the measurement [105]. 
Similar to the ^-|-jets channel, hadronic decays of the W 
bosons are used to measure the jet energy scale in situ 
with the available dataset. This technique provides an 
important reduction in the systematic uncertainty that 
would otherwise limit the precision of the Mt estimate in 
this channel to ~ 3%. 

The latest and most precise Mt measurement in this 
channel [87, 113] uses several kinematic and topological 
properties of the events as inputs to a neural network 
to separate the signal from the background. It also 
exploits the fact that all-hadronic top quark events 
produce at first approximation only quark jets, while 
the QCD background produces on average several gluon 
jets in the final state. Signal-like quark jets can be 
statistically discriminated from background-like gluon 
jets using calorimeters with sufficient granularity. In fact, 
observed differences in quark and gluon jet widths have 
been previously reported by experiments at the KEK 
e+e" collider (TRISTAN) [183] and the CERN e+e" 
coUider (LEP) [184, 185]. A cut is placed on the neural 
network output shown in Figure 25, and on the quality of 
the fit that assigns the jets to the final state partons when 
the top quark pair decay chain is reconstructed. Both 
cuts were chosen to minimize the statistical uncertainty 
on the top quark mass measurement. The event sample 
is split into two subsamples: with one 6-tagged jet and 
with two or more 6-tagged jets. For the latter subsample. 




Nou. 

Figure 25. Output of the neural network used 
to discriminate tt — > qiq2bq3q4b events from 
QCD multijet production [87]. 

S/B« 1. 

Miiltiple collisions occurring in the same bunch 
crossing often produce extra jets, and thus extra energy 
in the calorimeter. For this reason, as the collider 
instantaneous luminosity increased over time, so did 
the trigger rate. The data collected until the end of 
year 2007 has been obtained with the Tevatron colliding 
particles with an average instantaneous luminosity at the 
beginning of the store of about 10*^^ cm~^s~^. Under 
these conditions, the trigger rate was relatively low and 
the event selection efficiency is about 4%, higher than 
in the ^-|-jets case but with much higher backgrounds. 
Starting from 2008, the instantaneous luminosity at 
the beginning of the store was on average three times 
larger, thus the trigger rates and as a consequence 
the multijet background increased by a factor of 3. 
This was dealt with at the event selection level, by 
tightening the neural network output cut and the cut 
on the kinematic fitter goodness-of-fit. This tighter set 
of cuts provides better reconstructed events at the cost 
of a lower event selection efficiency. The reconstructed 
top quark mass and the reconstructed W boson mass 
distributions can be seen in Figiire 26. Owing to the 
improved analysis techniques, the statistical precision of 

the Mt measurement scaled better than with 1/^J Cdt, 
while the dominant systematic uncertainty due to the 
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Figure 26. Distribution of the reconstructed 
top quark mass (left) and W mass (right) for 
events surviving the event selection. The plots 
show the events with at least one 6-tagged 
jet passing the neural network event selection 
and the goodness-of-fit requirement. These two 
criteria are optimized to reduce the statistical 
uncertainty on the Mj and JES measurements, 
respectively [113]. 



jet energy scale has been linked to l/y J Cdt. The 
most precise determination of the top quark mass in 
the all-hadronic channel as of this writing gives Mt — 
172.5 ± 1.4(stat.) ± 1.0 (JES) ± 1.2(syst.) GeV/c^, or 
Mt = 172.5 ± 2.0 (total) GeV/c^. The ultimate precision 
that can be reached in this channel is now limited by 
1.1 GeV/c^ due to the systematic uncertainties other 
than the overall jet energy scale. It can be appreciated 
from Figure 27 that the measured value of Ajes is 
consistent with the prior expectation (mean value close 
to zero) with the precision a factor of three better than 
the prior constraint. 
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Figure 27. Simultaneous measurement of the 
top quark mass and the jet energy scale in 
the all-hadronic channel. The Aj^s parameter 
is related to the jet energy scale according 
to (8) [113]. 



7.3. Events with taus 

About 35% of decays of the tau lepton are pure leptonic, 
with an electron or a muon and two neutrinos in the final 
state. Charged leptons produced in such decays often 
pass standard electron or muon selection requirements. 
Therefore, the tt event selection criteria in the £-|-jets 
and dilepton channels actually accept a fraction of events 
which contain leptonically decaying taus. On the other 
hand, explicit identification of tau leptons which decay 
hadronically is difficult: these decays resemble common 
hadronic jets originating from quarks and gluons. For this 
reason, hadronic tau identification algorithms typically 
have to operate at low efficiency. For signals with 
many jets in the final state, as the tt decays under 
consideration here, the explicit tau identification is often 
not sufficient to suppress the QCD background in the 
sample. One can instead rely upon the experimental 
signature characterized by multiple jets and large f^T in 
the final state as an efficient way to collect the tt events 
with tau leptons. 

The first measurement of the tt cross section 
which utilized events with large f^T and multiple jets, 
with at least one 6-tagged jet, was performed by the 
CDF collaboration using 310 pb~^ of Tevatron Run II 
data [186] . A more recent CDF study of this experimental 
signature utilized 2.2 fb~^ of Run II data [89]. In both 
measurements, a veto on the presence of reconstructed 
electrons or muons was applied in order to avoid overlap 
with other channels. The more recent study utilized a 
neural network for background rejection achieving a S/B 
ratio of about 4. About 90% of the signal sample was 
composed of tt W^hW~h £v+A jets events, with 
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Table 7. Tcvatron Mt measurements in the 
^T+jets channel (all by CDF). 



/ Cdt 




C Stat 


^ syst 


Ref. 


(fb-i) 


(GcV/c2) 


(GeV/c2) 


(GeV/c2) 




0.3 


172.3 


10.2 


10.8 


[187] 
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[118] 
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Figure 29. Simultaneous measurement of the 
top quark mass and of the jet energy scale in 
the .^y+jets channel. The Aj^g parameter 
is related to the jet energy scale according 
to (8) [118]. 



Figure 28. Distribution of the reconstructed 
top quark mass for events with at least one b- 
tagged jet. The events are required to satisfy 
the neural network event selection and the 
goodness-of-fit criterion [118]. 

^ = T in 40% of the cases. 

In this sample, one of the two top quarks is expected 
to decay into Ivb, with charged lepton not identified. This 
decay provides httle information about the top quarli 
mass. Still, the other top quarli in the event produces 
three jets which can be used to reconstruct its decay chain 
and to measure the jet energy scale in situ. This feature is 
exploited in the recent CDF Mt measurement [118]. The 
kinematic analysis starts by identifying decay products of 
the hadronically decaying W. All pairwise combinations 
of jets without b tags are considered, and the pair which 
gives the invariant mass closest to the world average W 
mass is chosen. Then the jet with the highest Et is 
chosen among the remaining jets and added to the pair in 
order to construct a 3-jet system whose invariant mass is 
correlated with the top quark mass. Another such system 
is built by adding the jet with the second highest Et to 
the pair. 3-dimensional templates are constructed using 
the di-jet and the two tri-jet invariant masses. 

Owing to the large signal acceptance, the good S/B 
ratio, and the capability to constrain in situ the jet energy 
scale uncertainty, this measurement achieves a precision 
superior to that attainable in the dilepton channel. 
The resuh is Mt 172.3 ± 1.8(stat.) ± 1.5 (JES) ± 
1.0 (syst.) GeV/c2 = 172.3 ± 2.6 (total) GeV/c^, in 
agreement with the Mt estimates in other decay modes. 
This result is shown in Figure 29, together with the jet 
energy scale estimate. 

It should be noted that for this channel, just as for 
the £+jets and all-hadronic signatures, the uncertainty 
on the JES shift is much lower than the typical 3% 



uncertainty of the prior calibration. In other words, 
the collective light quark JES measurements obtained 
with about 5,000 tt events in the i+jets, all-hadronic 
and -^T+jets final states, validated the data-to-MC 
correspondence of the overall JES at better than one 
percent level. The Tevatron experiments did not use this 
value in data analyses other than the Mt measurements 
where they were derived, conservatively assuming that 
different jet multiplicities could alter the jet shapes 
and result in a systematic uncertainty different from 
the one quoted here. Still, the Mt measurements in 
these three channels utilize events that range in jet 
multiplicity from four to eight, with compatible results. 
Even though the details of the pt and 77 dependence of 
the light quark jet response remain to be understood 
at this level of precision, the Tevatron results suggest 
that, with the larger dataset available at the LHC, it 
should be possible to calibrate light quark jets using the 
hadronically decaying Ws from tt events with uncertainty 
of 1% or better. 

7.4- Dilepton measurements 

Due to the presence of two charged leptons and two 
neutrinos in the final state, the event samples used to 
measure Mt in the dilepton channel have excellent signal 
purity. Compared to other ti final states, physics effects 
beyond the SM affecting the Mt estimate are easier to 
detect. However, a relatively low branching fraction 
(~ 5%, not counting events with the r lepton) and 
the absence of the hadronic W decay which provides 
a natural reference point for jet energy calibration in 
other channels limit achievable statistical and systematic 
precision of the Mt estimate. The dilepton measurements 
thus complement more precise results obtained in other 
channels. 

At the time of this writing, the dilepton Mt 
measurement with the smallest uncertainty is performed 
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with 5.4 fb"^ of data collected by the D0 detector in 
Tevatron Run II [188]. The events are naturally split 
into subsamples with ee, e/^, or /^/i in the final state, 
with different expected backgrounds. For the ee and /x/x 
final states, the events are selected by a set of single- 
lepton triggers. For the e/i channel, a mix of single and 
multilepton triggers and lepton+jet triggers is utilized. 
The events are required to have two oppositely charged 
isolated leptons with pT > 15 GeV/c, and either \ri\ < 
1.1 or 1.5 < \r]\ < 2.5 for electrons and I77I < 2 for 
muons, ensuring high efficiency and low fake rate for 
lepton identification. At least two jets are required with 
Pt > 20 GeV/c and I77I < 2.5. Background is further 
suppressed by the Ht > 115 GeV requirement in the efj, 
final state and by requiring substantial missing transverse 
momentum in the ee and /Lt/x states. 479 candidate events 
are selected with 73, 266, and 140 events, respectively, 
in the ee, e/x, and /x/x channels, of which about 13 ± 5, 
48 ± 15, and 56 ± 15 events, respectively, are expected to 
arise from background. The sample purity (S/B « 3) is 
already sufficient so that explicit b flavor identiflcation is 
not required for jets. 

The Mt estimate is constructed by the matrix 
element method. In the calculation of event observation 
probability, two contributions are taken into account: 
the qq — )■ ti signal process and -H- Z + 2 jets 
production which is expected to be the dominant source 
of background. The jet energy transfer functions are 
modeled according to (10). Directions of all jets and 
charged leptons, as well as electron energies, are assumed 
to be perfectly measured. The uncertainty of the 
muon track curvature determination is modeled by a 
Gaussian resolution function. Transfer functions are also 
constructed for the fraction of parent energy carried 
by electrons and muons produced in the r — > tugVT 
decays (relevant for the Z — T+r~ decay mode). Under 
the assumption that the direction of the charged lepton 
coincides with the direction of the parent tau and that the 
daughter leptons are massless, the appropriate laboratory 
spectrum is described in [189]. 

The phase space integration for the signal hypothesis 
is performed over the following variables: transverse 
momentum of the ti system with a prior derived from the 
ALPGEN event generator [75], energies of the h quarks, 
the lepton-neutrino invariant masses, the differences 
between neutrino transverse momenta, and the muon 
track curvature {p^^)- 

The background matrix element is calculated using 
VECBOS [175]. The background phase space is sampled 
over the energies of the two partons that produce the jets 
and, in the case of Z ^ t~^t~ decays, over the energy 
fractions of the charged leptons produced in tau decays. 
The Pt probability of the Z + 2 jets system is used as the 
additional weight, together with the leading order matrix 
element, parton distribution functions, and appropriate 
phase space factors. Background normalization is 
adjusted in a manner similar to that employed in the 
D0 Mt measurement in the l+jets channel (Section 7.1). 
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Figure 30. Calibrated and normalized 
likelihood for data as a function of with best 
estimate as well as 68% confidence level region 
marked by the shaded area [188]. 

The likelihood as a function of Mt is obtained by profiling 
over the tt signal fraction. The bias and pulls of the final 
estimate are corrected by a mapping function, and the 
Bayesian credible interval is extracted. The calibrated 
likelihood is shown in Figure 30. 

In this measurement, the largest component 
(2.2 GeV/c^) of the systematic uncertainty is contributed 
by the calibration of the jet response. The overall re- 
suh is Mt = 174.0 ± 1.8(stat.) ± 2.4(syst.) GeV/c^ or 
Mt = 174.0 ± 3.1 (total) GcV/c^. 

The current most precise CDF Mt measurement 
in the dilepton channel is performed by simultaneous 
application of the template method in the ^+jets and 
dilepton final states which allows for in situ calibration 
of the jet energy scale in both channels [65]. 5.6 fb"^ of 
data collected by the CDF detector in Tevatron Run II 
is used, with ^+jets selection criteria similar to those 
described in Section 7.1. The dilepton events are required 
to have two well-identified oppositely charged leptons and 
at least two jets with Et > 15 GeV and \ri\ < 2.5. 
To further reject backgrounds, both missing transverse 
energy and Ht requirements are imposed: ^t > 25 GeV 
and Ht > 200 GeV. 

Two-dimensional templates are constructed for the 
Mt determination by kernel density estimation (KDE). 
The variables used are the event-by-event mt values 
obtained with the neutrino weighting algorithm (j/WT, 
described in Section 5.3) and the transverse mass 771x2- 
Distributions of these quantities in the data and in a 
simulated sample with Mt = 170 GeV/c^ are shown 
in Figure 31. In the ^-|-jets channel, mt values are 
obtained by kinematic fitting based on minimization 
(Section 5.3). Twelve jet-to-parton assignments are 
considered which result in distinct x^- The 7714 values 
from the two fits with the lowest ^re combined 
with mjj (for jets assumed to come from the hadronic 
W decay) from the lowest permutation, and three- 
dimensional templates are built by KDE. As both signal 
and background template shapes depend on the presence 
of 6-tagged jets in the sample, the templates are made 



CONTENTS 



34 



• Data (5.6 fb"^) 




(GeV/c^) 

Figure 31. Distributions of mt values obtained 
with tlie i/WT method (left) and of the 
transverse mass m'p2 (fight) using CDF b- 
tagged events. The data is overlaid with KDE 
templates [65]. 

separately for a number of subsamples: 1-tag and 2-tag 
for ^+jets events (at least one secondary vertex b tag 
is required), non-tagged and tagged for dilepton events. 
The overall likelihood is obtained as a function of Mt 
and AjBs by multiplying together the likelihoods from 
all subsamples. The /S.jes nuisance parameter is then 
eliminated by profiling. Finally, remaining biases and 
pulls are corrected by a mapping function. 

Despite cross-calibration from the £+jets channel, 
the systematic uncertainty of this measurement is still 
dominated by jet response modeling (of h jets in 
particular) which contributes 3.0 GeV/c^. The obtained 
resuh is Mt = 170.3 ± 2.0(stat.) ± 3.1 (syst.) GeV/c^, 
with a total uncertainty of 3.7 GeV/c^. Tevatron Mt 
measurements in the dilepton channel are summarized in 
Table 8. 

7. 5. Measurements which do not use jets 

A number of unconventional top quark mass measure- 
ment ideas were proposed in the recent literature and 
tested on the Run II Tevatron data. These proposals 
aim to reduce the systematic uncertainty (especially its 
part related to the jet energy response calibration), often 
at the cost of a substantial degradation in the statistical 
precision. Even though at this time such methods are not 
competitive with the "traditional" techniques described 
earlier in this section, their special features could make 



them attractive for the analysis of massive tt event sam- 
ples which will be accumulated by the LHC experiments. 
A few of these ideas are described below. 

Mt measurements based on the tracking information 
alone were explored by CDF in [122]. Two variables were 
chosen to build the templates in the £-|-jets channel: Lxy 
which is the transverse distance between the primary and 
secondary vertices projected onto the jet direction and pt 
of the lepton produced in the W — >■ £ff decay. Standalone 
lepton Pt templates were later used by CDF to estimate 
Mt in the £-|-jets [125] and dilepton [124] channels. 

Both Lxy and lepton pT variables are sensitive to 
Mt because W bosons and b quarks produced in the 
t — > Wb decays receive higher transverse boosts from 
heavier top quarks, as illustrated in Figure 17. While the 
sensitivity of such variables to the calorimeter hadronic 
jet energy scale calibration is indeed minimal, reliance 
upon manifestly non-Lorentz invariant quantities has its 
own disadvantages. Both variables are highly sensitive 
to the mismodeling of parton distribution functions 
which affect typical center-of-mass energies and therefore 
average boosts of parent top quarks. The sensitivity to 
the uncertainties in background shape and composition 
is also increased. In addition, the Lxy distributions 
in Monte Carlo are strongly affected by the b jet 
fragmentation model, while the lepton pT measurements 
suffer from imprecise energy scale calibration of the 
electromagnetic calorimeter. On the other hand, the Lxy- 
based estimate is essentially complementary to all other 
methods of top mass determination, and constitutes an 
independent source of information about Mt (the same 
can not be said about lepton px which is used either 
explicitly or implicitly by kinematic fitting and matrix 
element techniques). 

The invariant mass, M^^, of the charged lepton from 
the W decay and the muon from a semileptonic decay 
of the b quark (originated from the same parent top 
as the leptonically decaying W) was used as template 
variable in the CDF study [126]. The efficiency to 
"tag" a tt event passing typical i+]ets event selection 
criteria by finding a soft muon produced in a decay of 
a b quark is about 14% [194]. This leads to a sizeable 
event sample: 248 events were selected in the 2.0 fb"^ of 
CDF Run II data. The selection included events with 3 
jets which resulted in a somewhat higher efficiency but 
lower than usual S/B of about 2. The average Mg^ of 
the selected sample depends approximately linearly on 
Mt, as illustrated in Figure 32. In this analysis, the 
dominant components of the systematic uncertainty are 
due to imprecise knowledge of the background shape 
and fraction in the M^^ template (tibg — 1-9 GeV/c^, 
statistically limited by the size of the available event 
sample) and due to uncertainties in signal modeling, 
especially b jet fragmentation, conservatively estimated 
by comparing Mt estimates obtained for event samples 
generated by HERWIG [69] and PYTHIA [17] (cr, = 
2.1 GeV/c2). 

The Mt estimates obtained by methods which 
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Table 8. Representative Tevatron Mt measurements in the dilepton channel. The "+" sign between the methods 
indicates that the overall result is obtained by combining multiple techniques. "0" joins variables used in multivariate 
templates. 



Experiment 


Method 


JCdt 


Mt 


(T stat 


(7 syst 


Ref. 






(fb-i) 


(GcV/c2) 


(GcV/c2) 


(GeV/c2) 




CDF (Run I) 


i/WT 


0.11 


167.4 


10.3 


4.8 


[102] 


D0 (Run I) 


A^WT + z/WT 


0.13 


168.4 


12.3 


3.6 


[127, 128] 


D0 


TWWT 


0.23 


155 


+ 14 
-13 


7 


[190] 


CDF 


MEM 


0.34 


165.2 


6.1 


3.4 


[191, 151] 


CDF 


DLM 


0.34 


166.6 


+7.3 
-6.7 


3.2 


[159] 


CDF 


j^WT + KIN + PHI 


0.36 


170.1 


6.0 


4.1 


[107] 


D0 


A^WT + z^WT 


0.37 


178.1 


6.7 


4.8 


[110] 


D0 


A^WT + z/WT 


1.0 


174.7 


4.4 


2.0 


[112] 


CDF 


MEM 


1.0 


164.5 


3.9 


3.9 


[152] 


CDF 


KIN + cross section 


1.2 


170.7 


-1-4- 9 

-3.9 


3.5 


[111] 


CDF 




1.9 


172.0 


+3.6 
-3.4 


3.8 


f93l 


CDF 


MEM 


2.0 


171.2 


2.7 


2.9 


[160] 


D0 


MEM 


2.8 


172.9 


3.6 


2.3 


[192] 


CDF 


PHI 


2.9 


165.5 


+3.4 
-3.3 


3.1 


[108] 


CDF 


z^WT ® mT2 


3.4 


169.3 


2.7 


3.2 


[115] 


D0 


MEM 


3.6 


174.8 


3.3 


2.6 


[193] 


D0 




5.3 


173.3 


2.4 


2.1 


[142] 


D0 


MEM 


5.4 


174.0 


1.8 


2.4 


[188] 


CDF 


z/WT ® mT2 


5.6 


170.3 


2.0 


3.1 


[65] 



Table 9. estimates which rely on tracking information (all by CDF). 



Template Channel / Cdt Mt (^giat Csyst Ref- 

quantity (fb"!) (GeV/c^) (GeV/c^) (GeV/c^) 



Lxy 


.^+jets 


0.7 


180.7 


+ 15.5 
-13.4 


8.6 


[121] 


Lepton pt 


dilepton 


1.8 


156 


20 


4.6 


[123] 


Lxy 


e+icts 


1.9 


166.9 


+9.5 
-8.5 


2.9 


[122] 


Lepton Pt 


£+jets 


1.9 


173.5 


+8.8 
-8.9 


3.8 


[122] 




.£+jets 


2.0 


180.5 


12.0 


3.6 


[126] 


Lepton Pt 




2.7 


176.9 


8.0 


2.7 


[125] 


Lepton Pt 


dilepton 


2.8 


154.6 


13.3 


2.3 


[124] 



rely mainly on tracking information are collected in 
Table 9. Potential relevance of these methods for top 
mass measurements with LHC event samples will strongly 
depend on the improvements in the modeling of parton 
distribution functions and b jet fragmentation, as well 
as on the availability of precise detector calibrations for 
lepton Pt- 

7.6. Top-antitop mass difference 

The D0 collaboration was the first to search for a 
violation of the CPT-invariancc in the tt production and 
decay processes by measuring the difference between the 
masses of the t and i quarks [195]. The latest D0 
measurement [196] of A = Mt — Mi is performed with 
an event sample which corresponds to about 3.6 fb"^ 
of Tevatron Run II data. The ^+jets final state is 



selected in which positively (negatively) charged leptons 
are used to tag the t (?) quarks in each event. The 
matrix element technique is utilized to construct the 
overall likelihood as a function of Msum = (Aft + Mt)/2, 
A, JES, and the signal fraction, in a manner similar 
to the D0 Mt measurement described in Section 7.1. 
The signal fraction nuisance parameter is eliminated 
by profiling, while JES is fixed to a constant value 
consistent with the W mass constraint. The resulting 
likelihoods, represented as functions of Mt and Mi, are 
shown in Figure 33 separately for the e+jets and /x+jets 
channels. The final likelihood for A is constructed by 
marginalizing Mgum with a flat prior and subsequently 
combining e+jets and /x+jets channels. 

Compared to an Mt estimate, importance of various 
sources of systematic uncertainties is substantially 
different in the A measurement. For example, the 
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Figure 32. The correlation between the mean 
value of the Af^^ histograms from simulated 
tt and background samples and the input Mt- 
The continuous line shows a linear fit to the 
points [126]. 
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Figure 33. Sample likelihoods as a function 
of Mt and for (a) e+jets and (b) /^+jets 
D0 event samples. The boxes, representing 
the bins in the two-dimensional histograms of 
the likelihoods, have areas proportional to the 
bin contents. The solid, dashed, and dash- 
dotted lines represent 1, 2, and 3 standard 
deviation contours of two-dimensional Gaussian 
fits before pull corrections are applied [196]. 



jet energy scale-related uncertainty of a few percent 
becomes relatively insignificant, while effects which can 
have different impact upon reconstruction of t vs. t 
decay products become of major concern. Such effects 
include the mismeasurement of the lepton charge and 
uncertainties from modeling differences in the response 
of the calorimeter to b and b jets (most notably, 
a different content of K'^/K^ mesons which have 
different interaction cross sections with the calorimeter 
material). Taking these considerations into account, 
the D0 collaboration obtains an estimate A — 0.8 ± 
1.8 (stat.)±0.5 (syst.) GeV/c^, consistent with Mt = Mi. 

The CDF measurement of A utilized 5.6 fb"^ of 
Run II data [197]. This measurement is also performed 
in the ^-j-jets channel, with the sample selection criteria 



similar to those used for the CDF Mt measurement 
described in Section 7.1. Events without b tags are 
admitted as well, with an additional requirement > 
250 GeV. The template approach is utilized. The 
event-by-event estimate of A, Amj, is obtained from a 
kinematic fit. is evaluated according to expression 
(2) in which the two terms that include mt are replaced 

by {Mmu - (M,^^ - dmreco/2))^/T^t + (Mbjj - [Msum + 

dmreco/'i)f 1^1- Msum is fixcd at 172.5 GeV/c^. Amt 
is subsequently defined by Amt — —Qt ■ dm™^^, where 
Qi is the sign of the lepton charge which distinguishes t 
and i, and dm^^^^ is the value of dmreco that minimizes 
the x^- Two-dimensional templates are constructed using 
the two Amt values which correspond to the best and 
second best (over distinct jet-to-parton assignments and 
kinematic solutions) values of x^, with input A varied 
between —20 and 20 GeV/c^. To improve the template 
modeling, the event sample is split into six subsamples 
with zero, one, or two b tags and different values of Qi. 
Templates are subsequently fitted to the distributions of 
best and second best Amt observed in the data. To 
determine A, log-likelihoods from all event subsamples 
are added and the statistical uncertainty is scaled by 1.04 
to ensure proper frequentist coverage (the bias is found 
to be consistent with zero). 

As in the D0 measurement, the dominant part of 
the systematic uncertainty is due to signal mismodeling. 
CDF obtains A = -3.3 ± 1.4 (stat.) ± 1.0 (syst.) GeV/c^, 
which is consistent with Mt = at the about 2a level. 

7. 7. Mass from cross section 

The theoretical ti cross section prediction depends on the 
top mass, as shown in Figure 4 in Section 2.1. Therefore, 
given a theory curve and a cross section measurement, 
it is possible to extract a top mass estimate. This has 
been done by the D0 collaboration using several cross 
section measurements: in the dilepton channel [198], 
using a combination of different channels [199], and, 
most recently, in the £-|-jets channel [13]. The latter 
measurement, compared with two theoretical cross 
section calculations, is shown in Figure 34. 

The measured value of the cross section is positioned 
at Mt = 172.5 GeV/c^, the mass value used to 
calculate the acceptance for the measurement. The 
gray band depicts the total experimental uncertainty 
plotted as a function of the top mass. The appoximate 
NNLO calculations are from Ref [11], the NLO-^NNLL 
calculations are from Ref [200] . The uncertainties in the 
theoretical calculations are shown by the dotted lines. 
The analysis extracting the top mass estimate from these 
cross section approximations makes the assumption that 
Mmc — Mpoic (see Section 1.1). The results are: 

Mpoic = 163.0t|:o GeV/c2 for NLO+NNLL 
A/poic = 167.5t|^ GeV/c2 for approx. NNLO 



The value obtained with the approximate NNLO 
calculation is closer to the top mass measured directly 
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Figure 34. Top cross section vs. top quark 
mass as calculated by different authors [11, 200]. 
The data point is the D0 measurement [13] 
mentioned in the text. 

but the uncertainties are large in both cases. D0 has 
also extracted a value for the top quark mass in the MS 
renormalization scheme, as mentioned in Section 1.1. 

8. Combination 

While all individual Mt measurements provide important 
information per se, a more precise estimate can still be 
obtained by combining the results. Such a combination 
is regularly performed by the Tevatron Electroweak 
Working Group (TEVEWWG) [201]. Normally, the 
current most precise measurement in each decay 
mode per experiment is selected to enter into the 
Tevatron combination. Utilizing measurements which 
use overlapping datasets requires the estimation of 
the statistical correlations among the measurements 
themselves. JVIany systematic uncertainties are correlated 
among channels as their sources affect either the modeling 
of the signal and/or background events or the calibration 
of the detector response to the final state objects. 
These correlations have to be understood in detail in 
order to combine the measurements properly. This 
necessitates a collaborative effort by the authors of the 
Mt measurements that are being combined. 

8.1. Method and general issues 

The CDF and D0 collaborations use the best linear 
unbiased estimator (BLUE) method [144, 145] to combine 
their measurements. The collaborations have been 
working jointly for a number of years with the purpose of 
understanding the sources of the systematic uncertainty 
and their interdependence. Once an independent 
uncertainty source is identified, its impact on the Mt 
measurements is evaluated by assigning a correlation 
matrix which has an entry for each pair of results. Before 



the final combination is performed, systematic sources are 
collected into groups that share similar physical origins 
and whose impact on the measurements is described by 
the same correlation matrix. This grouping is described 
below: 

Statistics. The statistical uncertainty associated with 
the Mt determination. 

iJES: Part of the JES uncertainty which originates from 
in situ calibration procedures that use the W boson mass 
in decay modes with at least one hadronically decaying 
W. 

aJES: Part of the JES uncertainty which originates 
from the difference in calorimeter electromagnetic over 
hadronic (e/h) response for b jets and light-quark jets. 

bJES: Part of the JES uncertainty which originates 
from generator modeling of b jets. For both CDF and 
D0, it includes the uncertainties arising from variations 
in the semileptonic branching fractions, b fragmentation 
functions, and differences in the color flow between b jets 
and light-quark jets. 

cJES: Part of the JES uncertainty which originates from 
the modeling of light-quark fragmentation and out-of- 
cone corrections. For D0 Run II measurements, it is 
included in the dJES category. 

dJES: Part of the JES uncertainty which originates 
from the finite size of the data samples used to 
calibrate the jet energy response of the detectors. This 
includes uncertainties associated with the ry-dependent 
JES corrections which are estimated using dijet data 
events. For D0 this also includes the uncertainties in 
the light jet response and uncertainties that arise due to 
the sample dependence of jet corrections derived with the 
7-l-jets data. 

rJES: The remaining part of the JES uncertainty. 
For CDF, this is dominated by uncertainties in the 
calorimeter response to light-quark jets, and also 
includes small uncertainties associated with the multiple 
interaction and underlying event corrections. For D0 
Run II measurements, uncertainty sources of this kind 
belong to the dJES category. 

LepPt: The systematic uncertainty related to the cali- 
bration of lepton transverse momentum measurements. 

Signal: The systematic uncertainty arising from 
uncertainties in the tt modeling. This is obtained 
summing in quadrature several different sources. It 
includes uncertainties in the ISR and FSR descriptions; 
the uncertainty on the knowledge of the PDFs; the 
systematic uncertainty arising from a variation of 
the phenomenological description of color reconnection 
between final state particles. The systematic uncertainty 
associated with variations of the physics model used to 
calibrate the data analysis methods, correlated across 
all measurements. It includes variations observed when 
ISAJET (Run I) or HERWIG or ALPGEN-hPYTHIA 
are substituted to PYTHIA in modeling the tt signal. 
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For D0 it also includes the uncertainty that arises from 
considering or ignoring higher order Feynman diagrams. 

Detector modeling (DetMod): Uncertainty in the 

modeling of the detector in the MC simulation, such 
as the uncertainty on the jet energy resolution and 
identification efficiency. 

Background from MC (BGMC): Uncertainty in the 

background modeling, including sample composition and 
shape of relevant distributions, including the choice of the 
factorization scale used to model W^+jets production. 

Background from data (BGData): This group 
includes uncertainties associated with the modeling of the 
QCD multijet background in the all-hadronic, ^r+jets 
and f+jets channels, uncertainties associated with the 
modeling of the Drell-Yan background in the dilepton 
channel 

Method: The systematic uncertainty arising from any 
source specific to a particular data analysis method, 
including the finite Monte Carlo statistics available for 
method calibration. 

Uranium Noise and Multiple Interactions (UN/MI) 

This is specific to D0 and includes the uncertainty aris- 
ing from uranium noise in the D0 calorimeter and from 
the multiple interaction corrections to the JES. For D0 
Run I these uncertainties were sizable, while for Run II, 
owing to the shorter calorimeter electronics integration 
time and in situ JES calibration, these uncertainties are 
negligible. 

Multiple Hadron Interactions (MHI): The system- 
atic uncertainty arising from the mismodeling of the dis- 
tribution of the number of collisions per Tevatron bunch 
crossing. 

Reasonable variations in the assignment of uncer- 
tainty sources to these groups, in the back-propagation 
of the bJES uncertainties to Run I measurements, and in 
the assumed magnitude of the correlations have produced 
a negligible impact on the combined measurement 
and its total uncertainty. 

8.2. Latest combination 

The latest Tevatron combination [202] utilizes twelve 
different measurements of Mt. These are the Run I 
and Run II CDF measurements in the ^-|-jets (l-|-j in the 
summary table) [100, 149], all-hadronic (all-j) [181, 113], 
and dilepton (di-1) [102, 203] channels, the D0 Run I 
and Run II measurements in the £-|-jets [104, 204] and 
dilepton [127, 193] channels, one CDF measurement in 
the ^-|-jets channel that used tracking information with 
minimal dependence on the jet response (trk) [122], 
and one CDF measurement performed in the ^r+jets 
signature [118]. All these measurements are summarized 
in Table 10 together with their uncertainties. 

All Run I measurements have relatively large 
statistical uncertainties due to the limited size of the 
tt samples collected. Their systematic uncertainties are 



dominated by the total jet energy scale (JES) uncertainty. 
In Run II, both CDF and D0 take advantage of the larger 
tt samples available and employ new analysis techniques 
to reduce both of these uncertainties. 

The statistical uncertainty is uncorrelated among the 
measurements due to the non-overlapping event samples 
used. The only possible exception to this rule is the CDF 
Run II Mt measurements in the £+jets channel named 
"1+j" and "trk" in the Table 10 caption. The latter 
employs a subset of the data sample used by the former. 
The statistical correlation between the trk analysis and 
an older Run II CDF 1+j measurement was studied using 
Monte Carlo signal-plus-background pseudo-experiments 
which correctly accounted for the sample overlap and 
was foimd to be consistent with zero. The statistical 
part of the JES systematic uncertainty (iJES) is also 
uncorrelated, for the same reasons. The correlation 
matrices for the statistical uncertainty and for the iJES 
category are thus the trivial unit matrices. Unit matrix 
is also used to represent correlations for the Method 
category. 

The correlation among the measurements for the 
:bJES, cJES, and Signal uncertainties is taken to be 
100%. The uncertainties in the aJES, dJES, LepPt, MHI, 
DetMod and BGData categories are taken to be 100% 
correlated among all Run I and all Run II measurements 
within the same experiment, but uncorrelated between 
Run I and Run II, and uncorrelated between the 
experiments. The uncertainties in the rJES and 
UN/MI categories arc taken to be 100% correlated 
among all measurements within the same experiment but 
uncorrelated between the experiments. The uncertainty 
in the BGMC category is taken to be 100% correlated 
among all measurements in the same channel and among 
experiments. 

The combined value of the top quark mass is Mt = 
173.18 ± 0.56 (stat.) ± 0.75 (syst.) GeV/c^. The value 
of this result is 8.3 for 11 degrees of freedom. The 
corresponding confidence level is 68%, indicating good 
agreement among the input measurements. Adding the 
statistical and systematic uncertainties in quadrature 
yields a total uncertainty of 0.94 GeV/c^, corresponding 
to a relative precision of 0.54% on the top quark mass. 
Rounded off to two significant digits in the uncertainty, 
the combined result is Mt = 173.2 ± 0.9 GeV/c^. 

The input measurements and the current combina- 
tion are displayed in Figure 35. The most recent CDF 
and D0 measurements in the ^-|-jets channel which use 
the matrix element method enter the combination with 
the largest weights, followed by the CDF template mea- 
surement in the all-hadronic channel. 

It can be deduced from Table 10 that the total JES- 
related uncertainty is 0.49 GeV/c^ with 0.39 GeV/c^ 
coming from the statistical component and 0.30 GeV/c^ 
from other sources. As the most significant fraction of 
the JES uncertainty is still statistical in nature, the Mt 
precision can be further improved by simply analyzing 
larger collision datasets already collected by CDF and 
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Table 10. Summary of the measurements used to determine the Tcvatron average Mt- Integrated luminosity 
(J C di) has units in fb^^, and all other numbers are in GeV/c^. The unecrtainty categories and their correlations 
are described in the Sec. 8.1. The total systematic uncertainty and the total uncertainty are obtained by adding the 
relevant contributions in quadrature, "n/a" stands for "not applicable" , "n/e" for "not evaluated". 







Run I published 






Run II published 




Run II prel. 
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D0 






CDF 




D0 


CDF 
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allh 


1+jt 


di-1 


1+jt 


di-1 


di-1 


Lxy 


1+jt 


di-1 


1+jt 


allh 


MEt 


all 


j£dt 


0.1 


0.1 


0.1 


0.1 


0.1 


5.6 


1.9 


5.6 


5.2 


3.6 


5.8 


5.7 


< 5.8 


Result 


186.0 


176.1 


167.4 


180.1 


168.4 


170.28 
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173.00 


173.97 
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172.47 


172.32 


173.18 


iJES 


n/a 
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n/a 


n/a 


n/a 


n/a 


n/a 


0.58 


n/a 


0.53 


0.95 


1.54 


0.39 


aJES 


n/a 


n/a 
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0.0 


0.0 


0.14 


n/a 


0.13 


1.57 


0.0 


0.03 


0.12 


0.09 


bJES 


0.6 


0.6 


0.8 


0.7 


0.7 


0.33 


n/a 


0.23 


0.40 


0.07 


0.15 


0.26 


0.15 


cJES 


3.0 


2.7 


2.6 


2.0 


2.0 


2.13 


0.36 


0.27 


n/a 


n/a 


0.24 


0.20 


0.05 


dJES 


0.3 


0.7 


0.6 


n/a 


n/a 


0.58 


0.06 


0.01 


1.50 


0.63 


0.04 


0.05 


0.20 


rJES 


4.0 


3.4 


2.7 


2.5 


1.1 


2.01 


0.24 


0.41 
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0.38 


0.45 


0.12 
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n/e 


n/e 


n/e 


0.27 


n/a 


0.14 


0.49 


0.18 






0.10 


Signal 


2.0 


2.6 


2.9 


1.1 


1.8 


0.73 


0.90 


0.56 


0.74 


0.77 


0.62 


0.74 


0.51 


DctMod 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.33 


0.36 


0.0 


0.0 


0.10 


UN/MI 


n/a 


n/a 
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1.3 


1.3 


n/a 
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n/a 
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n/a 


n/a 
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0.24 
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0.27 
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0.18 


0.0 
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0.14 


BGData 


0.0 
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0.14 


0.20 


0.06 


0.47 


0.23 


0.56 


0.12 


0.11 


Method 


0.6 
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0.7 


0.6 


1.1 


0.12 


2.50 


0.10 


0.10 


0.16 


0.38 


0.14 


0.09 


MHI 


n/e 


n/e 


n/e 


n/e 


n/e 


0.23 


0.0 


0.10 


0.0 


0.05 


0.08 


0.16 


0.08 


Syst 


5.7 


5.3 


4.9 


3.9 


3.6 


3.13 


2.82 


1.06 


2.45 


1.24 


1.40 


1.82 


0.75 


Stat 


10.0 


5.1 


10.3 


3.6 


12.3 


1.95 


9.00 


0.65 


1.83 


0.83 


1.43 


1.80 


0.56 


Total 


11.5 


7.3 


11.4 


5.3 


12.8 


3.69 


9.43 


1.23 


3.06 


1.50 


2.00 


2.56 


0.94 



D0. 

When the complete dataset produced by the 
Tevatron will be analyzed, the total uncertainty of the 
Mt determination will likely drop around 0.8 GeV/c^, 
even without further understanding of the systematic 
sources. By the time Tevatron operations will close at 
the end of 2011, LHC will have produced much larger tt 
samples, so the expected Mt statistical uncertainty will 
soon be negligible. It will be important to establish the 
correlations among the measurements performed at the 
Tevatron and at the LHC so that one single estimate of 
the top quark mass can still be provided to the particle 
physics community. 

9. Conclusions 

From the first direct observation of top quark pair 
production and throughout the whole lifetime of the 
Tevatron collider program, precision determination of 
the top quark mass has been the subject of intensive 
studies at the CDF and D0 experiments. A number 
of sophisticated Mt measurement techniques have been 
developed. Important methodological advances include 
the first practical application of the matrix element 



analysis method to hadron collider data and introduction 

of the in situ jet energy scale calibration with the W 
boson mass. These particular improvements led to a 
substantial reduction in the statistical and systematic 
uncertainty, respectively. Recent Mt measurements 
performed in different final states of the ti system, 
with multiple analysis methods and with independent 
experimental setups arc all in close agreement with each 
other, in accordance with Standard Model expectations. 

The current combined estimate of the top quark 
mass by CDF and D0 is 

Mt = 173.18 ± 0.56 (stat.) ± 0.75 (syst.) GeV/c^ 
= 173.18 ± 0.94 (total) GcV/c^ 

Its accuracy is limited by systematic uncertainties result- 
ing from a number of imprecisely known calibration pa- 
rameters, both theoretical and experimental. Although 
the knowledge of some of these parameters will auto- 
matically improve as more data becomes available, fur- 
ther reduction of the uncertainty will require more de- 
tailed understanding of the underlying physics processes. 
Whenever possible, this understanding should be trans- 
lated into the calculation of the corresponding systematic 
uncertainty on event-by-event basis. Modeling of parton 
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Figure 35. Summary of the input measure- 
ments and resulting Tevatron average mass of 
the top quark [202] . 

showering and hadronization, together with accurate rep- 
resentation of nonhncarities in the detector jet response, 
will remain critical for CDF and D0. 

In the future, when much larger tt samples are 
accumulated by the LHC experiments, Mt measurements 
which do not use jets could outperform more conventional 
techniques. The ultimate precision will be reached 
by scanning the ti center-of-mass energy production 
threshold at a high energy lepton collider. 
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